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The presenl invention relates to an osteoclastgenic inhibitory agent comprising an interleukin-18 (hereinafter ab- 
breviated as "IL-18") or its functional equivalent. 

Osteoblasts' bone formation and osteoclasts' bone resorption are well balanced in healthy living bodies and this 
keeps the bone tissues in normal conditions while old bone tissues are being replaced with fresh ones without altering 
the original bone shape. The phenomenon plays an important role in keeping living bodies' homeostasis such as the 
controlling of blood calcium concentration within a desired range. Once the balance is lost, especially when the bone 
resorption level exceeds the bone formation level, bone-related diseases and other diseases may be induced There- 
fore, elucidation of the whole mechanism of bone resorption in living bodies, particularly, elucidation of osteoclasts is 
greatly highlighted due to scientific and clinical significance thereof. 

However, the mechanism of osteoclast formation has not yet been completely elucidated even though interleukin 
1 as a promoter and interleukin 4 as an inhibitor were found. This is because, similarly as various phenomena in livinq 
bod.es. osteoclast formation in living bodies is controlled by the close and complicated relationship between promoters 
and inhibitors. Based on these, it is greatly expected to establish an effective osteoclastgenic inhibitory agent from the 
viewpoint of scientific and clinical aspects. 

The object of the present invention is to provide a novel and effective osteoclastgenic inhibitory agent To solve 
the object the present inventors energetically studied for IL-18, i.e.. one of cytokines as communication transferring 
substances in immune systems, which induces production of interferon-y (hereinafter abbreviated as "I FN-/) an im 
portant biologically active substance for immunocompetent cells, and granulocyte/macrophage colony-stimulat'inq fac- 
tor (hereinafter abbreviated as "GM-CSF"), and augments cytotoxicity and induces formation of killer cells At the 
finding, IL-18 was described as an interferon-y-inducing factor as reported by Haruki OKAMURA in Japanese Patent 
Kokai Nos. 27.189/96 and 193,098/96, and in Nature, Vol. 378, No. 6,552, pp 88-91 (1995). and then called IL-18 
according to the proposal of Shimpei USHIO et al., in The Journal of Immunology, Vol. 156, pp 4 274-4 279 (1996) 

The present inventors found that a particular gene, capable of inhibiting osteoclast formation from osteoclastic 
precursor cells in vitro, is specifically expressed in quantities in stroma cells derived from mouse myeloma Their further 
detailed analysis revealed that (i) the gene encodes IL-18 that includes SEQ ID NO 7 as a core sequence (ii) IL-18 
and functional equivalents thereof effectively inhibit osteoclast formation, and (iii) the inhibition is mainly due to the 
action of GM-CSF induced and produced by IL-18. 

Based on these, the present inventors solved the present object by an osteoclastgenic inhibitory agent comprising 
IL-18 or its functional equivalent as an effective ingredient 

FIG. 1 shows the structure of the recombinant DNA pKGFHH2. 

FIG 2 shows the structure of the recombinant DNA pCSHIGIF/MUT35. 

FIG 3 shows the structure of the recombinant DNA pCSHIGIF/MUT42 

FIG. 4 shows the structure of the recombinant DNA pBGHuGF. 

FIG 5 shows the structure of the recombinant DNA pKGFMH2 

In these figures. KGFHH2 cDNA means a cDNA encoding the IL-18 according to the present invention IGIF/ 
MUT35: a DNA encoding the IL-18 according to the present invention: IGIF/MUT42: a DNA encoding the IL-18 accord- 
l?™"?.? P ^ eSent lnvention: HulGIF; a chromosomal DNA encoding the IL-18 according to the present invention 
KGFMH2 cDNA; a cDNA encoding the IL-18 according to the present invention: 5S: a gene for 5S ribosomal RNA 
i^ot! 80 pf0mOter: rrnBT1 T2; a termination region of a ribosomal RNA operon: AmpR: an ampicillin resistent gene 
P BR322ori: a replication origin of Escherichia coli: CMV: a cytomegalovirus promoter: IFNss a nucleotide sequence 
encoding a signal peptide for subtype o.2b of human interferon-a. 

The present invention relates to an osteoclastgenic inhibitory agent comprising IL-18 or its functional equivalent 
as an effective ingredient The wording "IL-18" as referred to in the invention includes polypeptides with the above 
property independently of their sources and origins. For example, the IL-18 used in the present invention includes as 
internal partial amino acid sequences, the amino acid sequences of SEQ ID NO 1 SEQ ID NO 2 and SEQ ID NO 
3, as well as SEQ ID NO: 4 and SEQ ID NO: 5, and includes the amino acid sequence of SEQ ID NO 6 or SEQ ID 
NO 7 as a whole. The wording "functional equivalent(s)" as referred to in the present invention includes (i) those 
wherem one or more ammo acids in the amino acid sequence of I L-1 8 are replaced with different ammo acids (ii) those 
wherein one or more ammo acids are added to the N- and/or C-termini of the amino acid sequence of IL-18 (iii) those 
wherein one or more ammo acids are inserted into the internal sites of the amino acid sequence of IL-18 (iv) those 
wherein one or more amino acids in the N- and/or C-terminal regions of the amino acid sequence of IL-18 are deleted 
and (v) those wherem one or more amino acids in the internal regions of the amino acid sequence of IL-18 are deleted : 
all of these modifications should be made within the range that does not substantially lose the property of osteoclast 
formation by IL-18 among the inherent property of IL-18. Examples of such functional equivalents are described alonq 
with the.r detailed amino acid sequences in Japanese Patent Application No. 20.906/97 by the same applicant of the 
present applicant. , e . polypeptides which are capable of inducing production of interferon-gamma by immunocompe- 



2 



EP 0 861 663 A2 



tent cells, wherein said polypeptides contain either amino acid sequence wherein one or more cysteines are replaced 
with different amino acid(s) while leaving respective consensus sequences as shown in SEQ (D NOs; 1 , 2 and 4 intact, 
or that wherein one or more amino acids are added, removed and/or replaced at one or more sites including those in 
the consensus sequences but excluding those of the replaced cysteine. The different amino acids to replace the 
cysteine(s) are not restricted to any types, as far as the resulting polypeptide, containing an amino acid sequence 
replaced with the different amino acid(s) : exhibits an activity of inducing production of IFN-yby immunocompetent cells 
in the presence or absence of an appropriate cofactor, as the wild-type polypeptides containing SEQ ID NOs: 1 : 2 and 
4 as consensus partial amino acid sequences, and a stability significantly higher than that of the wild-type polypeptides. 
The different amino acids include serine, threonine, alanine, valine, leucine, isoleucine, histidine, tyrosine, phenyla- 
lanine, tryptophan, and methionine, among which the most preferable amino acid is serine or alanine. Embodiments 
of the amino acid sequences, containing SEQ ID NOs: 1 , 2 and 4 as consensus partial amino acid sequences, in which 
one or more cysteines are to be replaced with different amino acid(s) are the wild-type polypeptides containing SEQ 
ID NO: 6 or 7 SEQ ID NO: 6 contains cysteines at the 38th, 68th, 76th, and 127th positions from the N-terminus. SEQ 
ID NO: 7 contains cysteines at the 7th, 75th ; and 1 25th positions. The polypeptides include those containing the amino 
acid sequence of any one of SEQ ID NOs: 20-26, which are derived from the wild-type polypeptide containing SEQ ID 
NO: 6, those containing the amino acid sequence of SEQ ID NO: 27 or 28, which are derived from the wild-type 
polypeptide containing the amino acid sequence of SEQ ID NO: 7, and those containing an amino acid sequence 
derived from any one of SEQ ID NOs: 20-28 by adding, removing, and/or replacing one or more amino acids to and/ 
or at position(s) excepting the positions where the cysteine(s) have been replaced while retaining the desired biological 
activities and stability. The wording "one or more amino acids" means the number of amino acids which conventional 
methods such as site-directed mutagenesis can usually add, remove or replace. The polypeptides containing any one 
of SEQ ID NOs: 20-28 possess both stability and biological activities significantly higher than those of the wild-type 
polypeptides. 

The functional equivalents as referred to in the present invention further include glycosylated polypeptides of IL- 
18 and the above polypeptides. Any of these IL-18 and functional equivalents thereof, both of which are included to 
and referred to as M IL-18 W in the present invention, unless specified otherwise, can be used in the present invention 
independently of their origins; those prepared by separating from natural sources such as cell cultures and from arti- 
ficially synthesized ones using recombinant DNA technology and peptide synthesis. 

With economical viewpoint, methods of recombinant DNA technology are advantageously used: generally, desired 
IL-18 can be obtained by introducing DNAs encoding IL-18 into appropriate hosts derived from microorganisms, plants, 
and animals to form transformants, culturing the transformants in nutrient culture media in a conventional manner and 
purifying the cultures by conventional methods used for purifying cytokines. Any DNAs can be used as the above DNAs 
as long as they contain a DNA encoding IL-18, and can be suitably selected depending on the purpose of the use of 
the present osteoclastgenic inhibitory agent or on the recombinant DNA technology used. For example. Japanese 
Patent Kokai Nos. 193,098/96. 231 ,598/96, and 27,189/96 by the same applicant of the present invention disclose in 
detail methods for producing IL-18 by culturing transformed microorganisms into which DNAs including a cDNA en- 
coding mouse or human IL-18 are introduced: and Japanese Patent Application No. 185,305/96 by the same applicant 
of the present invention discloses in detail a method for producing IL-18 encoding human IL-18 by culturing transformed 
animal cells which have an introduced DNA that includes a chromosomal DNA encodes human IL-18. Japanese Patent 
Application No. 20,906/97 by the same applicant of the present invention discloses in detail a method for producing 
IL-18 by culturing transformed animal cells having an introduced DNA which includes a DNA encoding a functional 
equivalent of human IL-18. 

The aforesaid recombinant DNA technology has an economical advantage, but depending on the hosts and DNA 
sequences used, the IL-18 thus obtained may have somewhat different physicochemical property from those of IL-18 
produced and functions in vivo. Japanese Patent Application No. 67 : 434/96 by the same applicant of the present 
invention discloses in detail a preparation of IL-18 using established human cell lines as natural sources, and Japanese 
Patent Application No. 213,267/96 by the same applicant also discloses in detail the preparation using an interleukin- 
1 p-converting enzyme. The IL-18 obtained by those preparations can be estimated to have substantially the same or 
equal physicochemical property to that of IL-18 that is produced and functions in vivo, and the yield can be estimated 
to be slightly tower. However, such I L-1 8 has an advantage that it has a fewer side effects when used as pharmaceuticals 
directed to administering to warm-blooded animals in general and including humans. When applying purification meth- 
ods using monoclonal antibodies specific to IL-18, as disclosed in Japanese Patent Application No. 231,598/96 by the 
same applicant of the present invention, a relatively-high purity IL-18 can be obtained in a minimum labor and cost. 

The present osteoclastgenic inhibitory agent comprising the aforesaid IL-18 includes any types and forms usable 
to inhibit osteoclast formation both in vivo and in vitro. The present agent can be advantageously used as ingredients 
for cell culture media for animal cells, which satisfactorily inhibit osteoclast formation, maintain, proliferate, and/or 
differentiate the desired cells: components of screening kits for bone-related therapeutic agents: bone-resorption reg- 
ulatory agents: and agents for osteoclast-related diseases The bone-resorption regulatory agents include medica- 
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merits and health foods that exert an osteoclast genie inhibitory activity in vivo, control bone resorption to normal con- 
ditions, and improve unfavorable physical conditions such as a relatively-insignificant arthralgia. The agents for oste- 
oclast-related diseases include medicaments used to prevent and/or treat diseases caused by an excessive osteoclast 
formation and/or its function. Examples of such diseases are hypercalcemia, osteoclastoma, Behcet's syndrome, os- 
teosarcoma, arthropathy, chronic rheumatoid arthritis, deformity ostitis, primary hyperthyroidism, osteopenia, and os- 
teoporosis. Varying depending on the types of agents and diseases to be treated, the present agent is usually formulated 
into a liquid, paste, or solid form which contains 0.000002-100 w/w %, preferably, 0.0002-0.5 w/w % of IL-18. 

The present osteoclastgenic inhibitory agent can be IL-18 alone or compositions comprising IL-18 and one or more 
other ingredients such as carriers, excipients, diluents, adjuvants, antibiotics, and proteins such as serum albumin and 
gelatin as stabilizers; saccharides such as glucose, maltose, maltotriose, maltotetraose, trehalose, sucrose, isomal- 
tose, lactose, panose : erlose, palatinose, lactosucrose, raffinose, fructooligosaccharide, galactooligosaccharide, len- 
tinan, dextrin, pullulan, and sugar alcohols including sorbitol maltitol, lactitol, and maltotriitol; buffers comprising phos- 
phates or citrates mainly: and reductants such as 2-mercaptoethanol, dithiothreitol, and reduced glutathione; and op- 
tionally biologically active substances such as interferon-a, interferon-p, interferon-/, interleukin-2, interleukin-3 inter- 
leukin-6, interleukin-12. TNF-a, TNF-p, GM-CSF, estrogen, progesterone, chiormadinone acetate, calcitonin, soma- 
tokine, somatomedin, insulin-like growth factor, ipriflavone, parathyroid hormone (PTH), norethisterone, busulfan, an- 
citabine, cytarabine, fluorouracil, tetrahydrofurfuryl fluorouracil, methotrexate, vitamin D 2 , active vitamin D, Krestin® or 
polysaccharide K, L-asparaginase : and OK-432 or Picibanif®; and calcium salts such as calcium lactate, calcium chlo- 
ride, calcium monohydrogenphosphate, and L-calcium L-aspartate. When used as agents for administering to warm- 
blooded animals in general and including humans, i.e., agents for osteoclast-related diseases, the present agent can 
be preferably formulated into compositions by appropriately combining with one or more of the above physiologically- 
acceptable substances. 

The present osteoclastgenic inhibitory agent includes medicaments in a unit dose form used for administering to 
warm-blooded animals in general and including humans. The wording "unit dose form" means those which contain 
IL-18 in an amount suitable for a daily dose or in an amount up to four fold by integers or up to 1/40 fold of the dose, 
and those in a physically separated and formulated form suitable for prescribed administrations. Examples of such 
formulations are injections, liquids, powders, granules, tablets, capsules, troches, collyriums. nebulas, and supposito- 
ries. 

The present agent as an osteoclastgenic inhibitory agent effectively treat and prevent osteoclast-related diseases 
independently of oral and parenteral administrations. Varying depending on the types and symptoms of patients' dis- 
eases, the present agent can be administered to the patients orally intradermal^, subcutaneously. muscularly. or in- 
travenously at a dose of about 0.5 ug to 100 mg per shot preferably, at a dose of about 2 ug to 10 mg per shot of IL- 
18, 2-6 fold a day or 2-10 fold a week for one day to one year. 

In the below, with reference to experiments, the preparation, physicochemical property, and biological activity of 
the IL-18 according to the present invention are described: 

Experiment 1 

Preparation of human IL-18 

According to the method in Japanese Patent Kokai No. 231 : 598/96 by the same applicant of the present invention 
an autonomously-replicable recombinant DNA : pKGFHH2, linked to a cDNA encoding human IL-18, was prepared. 
Dideoxyribonucleotide sequencing analyzed that, as shown in FIG. 1, in the recombinant DNA KGFHH2 cDNA con- 
taining the base sequence of SEQ ID NO: 8 was linked to the downstream of Ptac ; a Tac promoter. The recombinant 
DNA pKGFHH2 contained the amino acid sequences of SEQ ID NOs: 1 to 5; these amino acid sequences were re- 
spectively encoded by nucleotides 46-63, 88-105, 400-420, 151-165, and 214-228 in SEQ ID NO: 8. 

According to the method in Japanese Patent Kokai No. 231,598/96, the recombinant DNA pKGFHH2 was intro- 
duced into an Escherichia co// Y1090 strain, ATCC 37197, and the strain was cultured. The produced polypeptide was 
purified by immunoaffinity chromatography to obtain a punfied human IL-18 with a purity of at least 95% in a yield of 
about 25 mg/f culture. According to the method in Japanese Patent Kokai No. 193,098/96 by the same applicant of 
the present invention, the purified human IL-18 was analyzed for biological activity and physicochemical property as 
indicated below; When culturing human lymphocytes, collected by a conventional manner from a healthy donor, in the 
presence of the purified human IL-18. IFN-yproduction was observed depending on the concentration of IL-18 resulting 
in a confirmation that IL-1 8 has an activity of inducing IFN-y production by lymphocytes as an immunocompetent cell 
In accordance with the method as reported by U. K. Laemmli in Nature, Vol. 227, pp. 680-685 (1970), the purified IL- 
18 was subjected to SDS-PAGE. resulting in a major band with an IFN-y inducing activity at a position corresponding 
to 18 : 500±3.000 daltons The IL-18 gave a pi of 4.9±1.0 as determined by conventional chromatofocusing Conven- 
tional analysis using "PROTEIN SEQUENCER MODEL 473A". an apparatus of Applied Biosystems Inc Foster City. 
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USA, revealed that the IL-18 had the amino acid sequence of SEQ ID NO: 9, i.e., the amino acid sequence of SEQ ID 
NO: 8 where a methionine residue was linked to the N-terminus. 

Experiment 2 

5 

Preparation of human IL-18 

According to the method in Japanese Patent Application No. 67.434/96 by the same applicant of the present 
invention, THP-1 cells, ATCC TIB 202, a human monocyte cell line derived from a mate with acute monocytic leukemia, 

w were inoculated to the dorsum subcutaneous tissues of new born hamsters, followed by feeding the hamsters for three 
weeks. Tumor masses, about 15 g weight each, formed in the subcutaneous tissues of each hamster, were extracted, 
dispersed in media, and disrupted. The polypeptide obtained from the disrupted cells was purified by immunoaffinity 
chromatography to obtain a purified human IL-18 in a yield of an about 50 ng/head. 

Similarly, according to the method in Japanese Patent Application No. 67,434/96, the purified human IL-18 was 

is analyzed and determined for biological activity and physicochemical property as indicated below: It was confirmed that 
culturing human lymphocytes, collected from healthy donors in a conventional manner, in the presence of different 
concentrations of the human IL-18, resulted in an IL-18 dose-dependent IFN-y production. This revealed that the human 
IL-18 has a biological activity of inducing I FN-y production by lymphocytes as an immunocompetent cell. In accordance 
with the method as reported by U. K. Laemmli in Nature, Vol. 227, pp. 680-685 (1970), the purified human IL-18 was 

20 subjected to SDS-PAGE using 2 w/v % dithiothreitol as a reductant, resulting in a major band with an IFN-y production 
inducing activity at a position corresponding to 18,000-19,500 daltons. According to the peptide map disclosed in 
Japanese Patent Application No. 67,434/96, the human IL-18 was treated with clostripain commercialized by Sigma 
Chemical Company, Missouri, USA, to obtain polypeptide fragments, followed by subjecting the fragments for fraction- 
ation to high-performance liquid chromatography (HPLC) using "ODS-120T*\ a column commercialized by Tosoh Cor- 

25 poration, Tokyo, Japan : and analyzing the amino acid sequences of the fragments from the N-terminus to reveal the 
following amino acid sequences of SEQ ID NOs: 10 to 13. These amino acid sequences were completely coincided 
with amino acids 148-157. 1-13, 45-58, and 80-96 in SEQ ID NO: 6. The data shows that the human IL-18 obtained 
in Experiment 2 has the amino acid sequence of SEQ ID NO: 6 and all the partial amino acid sequences of SEQ ID 
NOs: 1 to 5 

30 

Experiment 3 

Preparation of functional equivalents 

35 According to the method in Japanese Patent Application No. 20.906/97 by the same applicant of the present 

invention, it was prepared an autonomously-replicable recombinant DNA. pCSHIGIF/MUT35. was linked to a DNA 
encoding a functional equivalent of human IL-18 where cysteines 38 ; 68 ; and 76 in SEQ ID NO: 6 were respectively 
replaced with serine, serine, and alanine. Dideoxyribonucleotide sequence analysis revealed that as shown in FIG. 2, 
in the recombinant DNA. DNA 1GIF/MUT35 with SEQ ID NO: 1 4 linked to the downstream of a base sequence encoding 

-*o a signal peptide of subtype o.2b in human interferon-a in the same reading-frame, as reported by K. Henco et al., in 
Journal of Molecular Biology, Vol. 185, pp. 227-260 (1985), and had a stop codon for protein synthesis at further 
downstream. As shown in parallel in SEQ ID NO: 14, the amino acid sequence encoded by the recombinant DNA 
corresponded to SEQ ID NO: 6 where cysteines 38, 68, and 76 in SEQ ID NO: 6 were respectively replaced with serine, 
serine, and alanine. The recombinant DNA contained a nucleotide which encodes all the amino acid sequences of 

^5 SEQ ID NOs: 1 to 4 and the one of SEQ ID NO: 5 where cysteine at amino acid 5 in SEQ ID NO: 5 was replaced with 
alanine. These amino acid sequences were respectively encoded by nucleotides 46-63. 88-105, 400-420, 151-165, 
and 214-228 in SEQ ID NO: 14. 

According to the method in Japanese Patent Application No 20,906/97 by the same applicant of the present 
invention, the recombinant DNA pCSH!GIF/MUT35 was introduced intoCOS-1 cells, ATCC CRL 1650, an established 

^0 cell line derived from SV40 transformed African Green monkey kidney, followed by culturing the transformed cells. The 
produced polypeptide in the culture was purified by immunoaffinity chromatography to obtain a purified functional equiv- 
alent of human IL-18 in a yield of about 40 ng/ml culture. According to the method in Japanese Patent Application No. 
20,906/97, the purified functional equivalent was analyzed and determined for biological activity and physicochemical 
property as indicated below: When culturing KG-1 cells, ATCC CCL 246, an established cell line derived from human 

55 acute myelogenous leukemia, in the presence of different concentrations of the purified functional equivalent of human 
IL-18, IFN-y production was observed depending on the concentration of the IL-18, revealing that the IL-18 has a 
biological activity of inducing IFN-y production by KG-1 cells as an immunocompetent cell. In accordance with the 
method as reported by U. K. Laemmli in Nature, Vol. 227, pp. 680-685 (1970), the purified functional equivalent was 
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subjected to SDS-PAGE in the presence of 2 w/v % dithiothreitol as a reductant, resulting in a major band with an IFN- 
y production inducing activity at a position corresponding to 18,000-1 9 : 500 daltons. Conventional analysis using "PRO- 
TEIN SEQUENCER MODEL 473 A", an apparatus of Applied Biosystems. Inc., Foster City, USA. revealed that the N- 
terminal region of the functional equivalent had the amino acid sequence of SEQ ID NO: 15 which corresponded to 
the amino acid sequence in the N-terminal region as shown in parallel in SEQ ID NO: 14. 

Experiment 4 

Preparation of functional equivalent 

According to the method in Japanese Patent Application No. 20,906/97 by the same applicant of the present 
invention, it was prepared an autonomously-replicable recombinant DNA, pCSHIGIF/MUT42, which was linked to a 
DNA encoding for a functional equivalent of human IL-18 where cysteines 38, 68, 76, and 127 in SEQ ID NO: 6 were 
respectively replaced with serine, serine, alanine, and serine. Dideoxyribonucleotide sequencing revealed that as 
shown in FIG. 3, in the recombinant DNA, DNA IGIF/MUT42 with SEQ ID NO: 16 linked to the downstream of a base 
sequence encoding a signal peptide for subtype o2b of human interferorva in the same reading frame as reported by 
Henco et al.. in Journal of Molecular Biology, Vol. 185, pp. 227-260 (1 985), and had a stop codon for protein synthesis 
at further downstream. As shown in parallel in SEQ ID NO: 16, the amino acid sequence encoded by the recombinant 
DNA corresponded to SEQ ID NO: 6 where cysteines 38, 68, 76 ; and 1 27 in SEQ ID NO: 6 were respectively replaced 
with serine, serine, alanine, and serine. The recombinant DNA contained a nucleotide sequence which encodes all 
the amino acid sequences of SEQ ID NOs: 1 to 4 and the one of SEQ ID NO: 5 where cysteine 5 in SEQ ID NO: 5 was 
replaced with alanine. These amino acid sequences were respectively encoded by nucleotides 46-63 88-105 400-420 
151-165, and 214-228 in SEQ ID NO: 16. 

According to the method in Japanese Patent Application No. 20,906/97 by the same applicant of the present 
invention, the recombinant DNA pCSHIGIF/MUT42 was introduced into COS-1 cells, followed by culturing the cells 
The produced polypeptide in the culture was purified by immunoaffinity chromatography to obtain a purified functional 
equivalent of human IL-1 8 in a yield of about 20 ng/ml culture According to the method in Japanese Patent Application 
No. 20,906/97, the purified functional equivalent was analyzed and determined for biological activity and physicochem- 
ical property as indicated below: When cultured KG-1 cells in the presence of different concentrations of the purified 
functional equivalent a dose-dependent I FN -y production was observed, and this revealed that the functional equivalent 
has a biological activity of inducing I FN- / production by KG-1 cells as an immunocompetent cell In accordance with 
the method as reported by U. K. Laemmli in Nature, Vol. 227, pp 680-685 (1970), the purified functional equivalent 
was subjected to SDS-PAGE in the presence of 2 w/v % dithiothreitol as a reductant, resulting in a major band with an 
IFN-y inducing activity at a position corresponding to 18,000-19.500 daltons Conventional analysis using "PROTEIN 
stuUfcNCER MODEL 473A", an apparatus of Applied Biosystems, Inc., Foster City. USA. revealed that the N-terminal 
reg.on of the functional equivalent had the amino acid sequence of SEQ ID NO: 1 5 which completely corresponded to 
the amino acid sequence in the N-terminal region as shown in parallel in SEQ ID NO: 16. 

Experiment 5 

Preparation of human IL-18 

According to the method in Japanese Patent Application No. 185,305/96 by the same applicant of the present 
invention, an autonomously-replicable recombinant DNA, pBGHuGF. linked to a chromosomal DNA encoding human 
IL-18, was obtained. Dideoxyribonucleotide sequencing analysis revealed that as shown in FIG. 4. in the recombinant 
DNA, a chromosomal DNA, which encodes human IL-18. i.e., DNA HulGIF with SEQ ID NO: 17, was linked to the 
downstream of a restriction site by a restriction enzyme, Hind\\\. As shown in SEQ ID NO: 17. the chromosomal DNA 
HulGIF consists of 11.464 bp where the exbn was fragmented by four introns positioning at nucleotides 83-1 ,453 

I. 466-4.848, 4,984-6,317, and 6,452-11,224. Among the resting nucleotide sequence excluding these introns. nucle- 
otides 3-11 ,443 from the S'-terminus are the part that encodes a precursor of human IL-1 8, and nucleotides 4,866-4,983 
are the part that encodes an active human IL-18 The chromosomal DNA contained nucleotides sequences encoding 

SEQ ID NOs: 1 to 5: these amino acid sequences were respectively encoded by nucleotides 4 91 1 -4 928 4 953-4 970 

II. 372-11,392, 6.350-6,364, and 6,413-6.427 in SEQ ID NO: 17. 

According to the method in Japanese Patent Application No. 185,305/96. the recombinant DNA pBGHuGF was 
introduced into CHO-K1 cells, ATCC CCL 61, an established cell line derived from Chinese hamster ovary, followed 
by culturing the cells. The culture supernatant was contacted with a supernatant of cell disruptant prepared from a 
THP-1 celt culture to produce a polypeptide which was then purified by immunoaffinity chromatography to obtain a 
purified human IL-18 in a yield of about 15 mg/f culture. According to the method in Japanese Patent Application No 
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185,305/96, the polypeptide was analyzed and determined for biological activity and physicochemical property as 
indicated below: It was confirmed that human lymphocytes, which were collected from a healthy dcTrlor, produced IFN- 
y depending on the purified human IL-18 concentration when cultured at different concentrations of the human IL-18, 
revea jj n g that the human IL-18 has a biological activity of inducing IFN-y production by lymphocytes as an immuno- 
5 competent cell. In accordance with the method as reported by U K Laemmli in Nature, Vol 227, pp. 680-685 (1970), 
the purified human IL-18 was subjected to SDS-PAGE in the presence of 2 w/v % dithiothreitol as a reductant, resulting 
in a major band with an IFN-y inducing activity at a position corresponding to 18,000-19.500 daltons. The N-terminal 
region of the human IL-18 contained the amino acid sequence of SEQ ID NO: 15 which completely corresponded to 
the amino acid sequence in the N-terminal region of SEQ ID NO: 17 for an active IL-18. Experiment 6 

w 

Preparation of mouse IL-18 

To a 0.5-ml reaction tube were added 8 u.l of 25 mM magnesium chloride, 10 jal of 10 x PCR buffer, one pi of 25 
mM dNTP mix, one pi of 2.5 units/uJ of amplitaq DNA polymerase, one ng of a recombinant DNA, which encodes 
is mouse IL-18 having the nucleotide sequence of SEQ ID NO: 18 and the amino acid sequence of SEQ ID NO: 7, 
prepared from a phage DNA clone according to the method in Japanese Patent Kokai No. 27,189/96, and adequate 
amounts of a sense and antisense primers having nucleotide sequences represented by 5'-ATAGAATTCAAAT- 
GAACTTTGGCCGACTTCACTG-3 , and 5*-ATAAAGCTTCTAACTTTGATGTAAGTT-3\ respectively, which were chem- 
ically synthesized based on the amino acid sequences nearness to the N- and C-termini of SEQ ID NO: 7, and the 
20 mixture solution was brought up to a volume of 100 uJ with sterilized distilled water. The solution thus obtained was 
subjected in a usual manner to PCR reaction of the following three cycles of successive incubations at 94°C for one 
minute, 43°C for one minute, and 72°C for one minute, and further 40 cycles of successive incubations at 94°C for 
one minute, 60°C for one minute, and 72°C for one minute. 

The product obtained by the PCR reaction and "pCR-Script SK (+)", a plasmid vector commercialized by Strata- 
25 gene Cloning Systems, California, USA. were in a conventional manner ligated together using a DNA ligase into a 
recombinant DNA which was then introduced into "XL-1 Blue MRF'Kan", an Escherichia co// strain commercialized 
by Stratagene Cloning Systems, California, USA., to obtain a transformant. The transformant was inoculated to L-broth 
(pH 7.2) containing 50 jag/ml ampicillin : followed by the incubation at 37°C for 18 hours under shaking conditions. The 
culture was centrifuged to obtain the proliferated transformants which were then treated with a conventional alkali-SDS 
30 method to isolate a recombinant DNA. A portion of the recombinant DNA isolated was analyzed by dideoxyribonucle- 
otide sequencing, revealing that the recombinant DNA contained restriction sites of Eco Rl and Hind 111 at the 5'- and 
3* -termini of SEQ ID NO: 18, respectively, and a DNA containing a methionine codon for initiating polypeptide synthesis 
and a TAG codon for terminating polypeptide synthesis, which were located in just before and after the N- and C-termini 
of the amino acid sequence as shown in parallel in SEQ ID NO: 18. The recombinant DNA contained the nucleotide 
35 sequences of SEQ ID NOs: 1 to 5 These amino acid sequences were encoded by nucleotides 46-63, 85-1 02, 394-41 4 T 
148-162, and 211-225 in SEQ ID NO: 18. 

The remaining portion of the recombinant DNA was in a conventional manner cleaved with restriction enzymes of 
Eco Rl and HindW and the resulting 0.1 ug of an Eco Rl-Hind 111 DNA fragments, obtained by using "DNA LIGATION 
KIT VER 2", a DNA ligation kit commercialized by Takara Shuzo Co , Ltd., Tokyo, Japan, and 10 ng of pKK223-3, a 
40 plasmid vector commercialized by Pharmacia LKB Biotechnology AB, Uppsala, Sweden, which had been cleaved with 
a restriction enzyme were linked together, by incubating at 16°C for 30 min to obtain an autonomously-replicable 
recombinant DNA, pKGFMH2. Using competent celt method, an Escherichia coli Y1090 strain, ATCC 37197, was 
transformed using the recombinant DNA pKGFMH2, and the resulting transformant, KGFMH2, was inoculated to L- 
broth (pH 7.2) containing 50 u.g/ml ampicillin. and cultured at 37°C for 18 hours under shaking conditions. The culture 
4 5 was centrifuged to collect the proliferated transformants, followed by applying a conventional SDS-alkali method to a 
portion of the transformants to extract the recombinant DNA pKGFMH2. Dideoxynbonucleotide sequencing analysis 
revealed that, as shown in FIG. 5, KGFMH2 cDNA containing the nucleotide sequence of SEQ ID NO: 18 was linked 
to the downstream of the Tac promoter in the recombinant DNA pKGFMH2. 

Ampicillin was added to L-broth (pH 7.2), which had been sterilized by autoclaving, to give a concentration of 50 
so ug/ml, cooled to 37°C, and inoculated with the transformant KGFMH2, followed by the culture at 37°C for 18 hours. 
Eighteen liters of a fresh preparation of the same culture medium was placed in a 20-f jar fermenter, similarly sterilized 
as above, admixed with ampicillin : cooled to 37°C and inoculated with one v/v % of the seed culture obtained in the 
above, followed by the culture at 37°C for 8 hours under aeration-agitation conditions. The resulting culture was cen- 
trifuged to collect the cultured cells which were then suspended in a mixture solution (pH 7.3) containing 150 mM 
55 sodium chloride. 16 mM disodium hydrogenphosphate, and 4 mM sodium dihydrogenphosphate, disrupted by ultra- 
sonication, and centrifuged to remove cell disruptanL and this yielded an about two liters of a supernatant. 

To an about two liters of the supernatant was added 10 mM phosphate buffer (pH 7.3) containing ammonium 
sulfate to give a 40% ammonium saturation. The resulting sediment was removed by centrifugation . and the supernatant 
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was mixed with ammonium sulfate to give an 85% ammonium saturation, allowed to stand at 4°C for 18 hours and 
centrif uged at about 8.000 rpm for 30 min to obtain a newly formed sediment The sediment thus obtained was dissolved 
in 10 mM phosphate buffer (pH 6 6) containing 1.5 M ammonium sulfate to give a total volume of about 1 .300 ml. and 
the solution was filtered, and fed to a column packed with about 800 ml of "PHENYL SEPHAROSE CL-6B" a gel 
commercialized by Pharmacia LKB Biotechnology AB. Uppsala, Sweden, followed by washing the column with a fresh 
preparation of the same buffer and feeding to the column a linear gradient buffer of ammonium sulfate decreasing from 
1.5 M to O M in 10 mM phosphate buffer (pH 6.6) at an SV (space velocity) 1.5 Fractions efuted at around 1 M 
ammonium sulfate were pooled : concentrated using a membrane filter, and dialyzed against 10 mM phosphate buffer 
(pH 6.5) at 4°C for 18 hours. The dialyzed solution was fed to a column packed with about 55 ml of "DEAE-5PW" a 
gel commercialized by Pharmacia LKB Biotechnology AB, Uppsala, Sweden, which had been equilibrated with 10 mM 
phosphate buffer (pH 6.5). The column was washed with a fresh preparation of the same buffer, and fed with a linear 
gradient buffer of sodium chloride increasing from 0 M to 0.5 M in 10 mM phosphate buffer (pH 6.5) at SV 5.5. followed 
by collecting fractions eluted at around 0.2 M sodium chloride. Thereafter, the fractions were pooled and concentrated 
similarly as above up to give an about nine milliliters, followed by dialyzing the concentrate against PBS (phosphate 
buffered saline) at 4°C for 18 hours, and feeding the dialyzed solution to a column packed with "SUPERDEX 75" a 
gel commercialized by Pharmacia LKB Biotechnology AB, Uppsala. Sweden, which had been equilibrated with a fresh 
preparation of the same PBS. The column was fed with a fresh preparation of the same PBS to collect fractions with 
an IFN-y inducing activity, and the fractions were pooled and concentrated with a membrane filter to obtain a purified 
mouse IL-18 in a yield of about 350 |ig/f culture. 

According to the method in Japanese Patent Kokai No. 27,189/96 : the purified mouse IL-18 was analyzed and 
determined for biological activity and physicochemical property as indicated below: Culturing mouse spleen cells, col- 
lected by a conventional manner under different concentrations of the mouse IL-18 resulted in an IFN-y production 
depending on the concentrations of the mouse IL-18, and this revealed that the mouse IL-18 has an activity of inducing 
IFN-y production by spleen cells as an immunocompetent cell. In accordance with the method as reported by U K 
Laemmli in Nature, Vol. 227, pp. 680-685 (1970) : the purified human IL-18 was subjected to SDS-PAGE under non- 
reducing conditions, resulting in a major band with an IFN-y inducing activity at a position corresponding to 
19 : 000±5,000 daltons. The N-terminal region of the mouse IL-18 contained the amino acid sequence of SEQ ID NO 
19 which corresponded to the N-terminal region of SEQ ID NO: 18. 

With reference to Experiment 7 the biological activity of the IL-18 according to the present invention will be de- 
scribed in more detail, and Experiment 8 describes the cytotoxicity of the IL-18: 

Experiment 7 
Biological activity 
Experiment 7-1 

Induction of GM-CSF production 

Using a heparinized syringe, blood was collected from a healthy volunteer and diluted two fold with serum-free 
RPMI 1640 medium (pH 7.4). The diluent was overlaid on a ficoll and centrifuged, and the collected lymphocytes were 
washed with RPMI 1640 medium (pH 7.4) supplemented with 10 v/v % fetal calf serum, and suspended in a fresh 
preparation of the same medium to give a cell density of 1 x 10 6 cells/ml, followed by distributing the cell suspension 
to a 1 2-wel( microplate by two ml/well. 

Using RPMI 1640 medium (pH 7.4) supplemented with 10 v/v % fetal calf serum, an IL-18 preparation obtained 
by the method in Experiment 1 was prepared into a one ug/ml solution which was then distributed to the above micro- 
plate by 20-200 ul/well. To the microplate was further added a fresh preparation of the same buffer, supplemented with 
500 pi/ml of Concanavalin A : by 10 ul/well, followed by the incubation at 37°C for 48 hours in a 5 v/v % C0 2 incubator. 
After completion of the culture, supernatants in each well were sampled by 0.1 ml/well and determined for GM-CSF 
content using a conventional enzyme immunoassay. In parallel, a culture system free of IL-18 as a control was provided 
and treated similarly as above. The data is in Table 1: 

Table 1 



IL-18* (nM) 


GM-CSF yield (pg/ml) 


0 


510 


0.7 


2.150 
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Table 1 (continued) 



IL-18* (nM) 


GM-CSF yield (pg/ml) 


2.8 


3,050 


5.6 


3,950 


Note: The symbol means that IL-18 was added to the culture system in the presence of 2.5 pg/ml of Con- 
canavalin A. 



w The results in Table 1 indicate that lymphocytes as an immunocompetent cell produced GM-CSF depending on 

the concentration of IL-18 when contacted with IL-18 in the presence of Concanavalin A as a cofactor. It was also 
confirmed that all of the IL-18 preparations and functional equivalents thereof, which were obtained by the methods in 
Experiments 2 to 5, induced GM-CSF production even when used alone similarly as above. An IL-18 preparation 
obtained by the method in Experiment 6 was tested in accordance with Experiment 7-1 except that the human lym- 

is phocytes used in the experiment were replaced with spleen cells prepared from mouse by a conventional manner, 
revealing that the IL-18 preparation also induced GM-CSF production. 

Experiment 7-2 

20 Inhibition of osteoclast formation 

Experiment 7 -2(a) 

As reported by T J. Martin and K. W. Ng in Journal of Cellular Biochemistry, Vol. 56, pp. 357-366 (1994), it is 

25 considered requisite for contacting osteoclastic precursor cells, derived from hematopoietic stem cells, with osteoblasts 
or bone marrow stromas to generally differentiate osteoclastic precursor cells into mature osteoclasts. As described 
by G. D. Roodman in Endocrine Reviews, Vol. 17, No. 4, pp. 308-332 (1996), it is generally recognized that osteoclasts 
have characters of multinucleated cells, tartaric acid-resistant acid phosphatase (hereinafter abbreviated as "TRAP") 
activity, and a calcitonin receptor. In a co-culture system of osteoblasts and bone marrow cells as reported by Nobuyuki 

30 UDAGAWA et a! . in Journal of Experimental Medicine, Vol. 182, pp. 1,461-1,468(1995), these cells respond to factors 
such as 1 a,25-dihydroxyvitamin D 3 , prostaglandin E 2 adrenocortical hormone, interleukin 1, interleukin 6. and inter- 
leukin 11 , to form osteoclast-like cells (hereinafter may be abbreviated as "OCL") The formed OCL has characters of 
osteoclasts in vivo. Therefore, the co-culture system well reflects in vitro the processes of osteoclast formation in vivo. 
Using this system, experiments for osteoclast formation and osteoclastgenic inhibitory agents can be carried out. 

35 The osteoclastgenic inhibitory activity of the IL-18 according to the present invention was studied using the above 

co-culture system The osteoblasts used in this experiment were prepared in a conventional manner by treating a 
newborn mouse calvaria with 0.1 w/v % collagenase commercialized by Worthington Biochemical Co.. Freefold, Aus- 
tralia, and 0.2 w/v % dispase commercialized by Godo Shusei Co., Ltd., Tokyo, Japan. The bone marrow cells were 
prepared from a mature mouse in a conventional manner. As a negative control. 2 x 10 4 cells of a primary cell culture 

40 of osteoblasts and 5 x 10 s cells of bone marrow cells were co-cultured in each well of a 48-well microptate containing 
0.4 ml/well of a-MEM medium supplemented with 10 v/v % fetal calf serum (hereinafter designated as "Medium" 
throughout Experiment 4-2) at 37°C for seven days in a 5 v/v % C0 2 incubator. As a positive control, the above two- 
types of cells were co-cultured similarly as in the negative control except that they were cultured in other wells containing 
10* 8 M of 1a,25-dihydroxyvitamin D 3 commercialized by Wako Pure Chemicals, Tokyo, Japan, and 10~ 7 M of prostag- 

45 landin E 2 commercialized by Sigma Chemical Company, Missouri, USA. The aforesaid two-types of cells were co- 
cultured similarly as in the positive control except that they were cultured in other welts containing 1a,25-dihydroxyvi- 
tamin D 3 commercialized by Wako Pure Chemicals, Tokyo, Japan, and prostaglandin E 2 commercialized by Sigma 
Chemical Company Missouri, USA., in the same concentrations as used in the positive control, and a concentration 
of 0.01-10 ng/ml of an IL-18 preparation prepared by the method in Experiment 6. In every co-culture system, the 

so media in each well were replaced with fresh preparations of the same media used in the co-culture systems on the 3rd 
day after the initiation of each culture. According to the method by Nobuyuki UDAGAWA in Journal of Experimental 
Medicine, Vol. 182, pp. 1 ,461-1 ,468 (1995), the cells on the 6th day after the initiation of each culture were fixed and 
stained based on TRAP activity, followed by counting the stained cells (hereinafter called TRAP-positive cells") per 
well. Throughout Experiment 4-2, quadruplet wells under the same conditions were provided for each co-culture system 

55 and the mean value for the TRAP-positive cells per well in each system was calculated The results are in Table 2. 
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As shown in Table 2. the formation of TRAP -positive cells was not substantially observed in the negative control, 
but the distinct formation was observed in the positive control In the co-culture systems, i.e., the positive control 
supplemented additionally with IL-18, the formation of TRAP -positive cells was inhibited depending on the concentra- 
tion of IL-18, and the maximum inhibition, i.e.. a level equal to that in the negative control, was found at eight ng/ml or 
s more of IL-18. These data strongly indicates that IL-18 has a concrete activity of inhibiting OCL formation in vitro an6 
also inhibits osteoclast formation. 

Experiment 7-2(b) 

w As described hereinbefore, it was confirmed that there exist factors that induce the formation of osteoclast-like 

cells in the co-culture systems used throughout Experiment 7-2. Therefore, in this Experiment 7-2(b) t it was studied 
whether the inhibitory activity of IL-18 on osteoclast formation observed in Experiment 7-2(a) was specific to some 
factors or not; the osteoclast-like cells were cultured by the same method as used in the negative control in Experiment 
7-2(a) except for using a medium supplemented with 10 8 M 1a,25-dihydroxyvitamin D 3 , 10' 7 M prostaglandin E 2 , 200 

is ng/ml parathyroid hormone, 100 ng/ml interleukin 1 , or 20 ng/ml interleukin 1 1 These culture systems were for positive 
controls. In parallel, the cells were cultured in other wells by the same method used in the positive controls except for 
using a medium containing 10 ng/ml of an IL-18 preparation obtained by the method in Experiment 6, in addition to 
any one of the above factors at the same concentration. After completion of the cultures, TRAPiDositive cells in each 
we || were counted, and the numbers were compared similarly as in Experiment 7-2(a). The results are in Table 3: 
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As shown in Table 3. a distinct formation of TRAP-positive cells was observed in every positive control, but the 
formation was almost completely inhibited in the presence of IL-18. This strongly indicates that IL-18 has a wide and 
general activity of inhibiting osteoclast formation independently of osteoclast-formation-related factors 

5 Experiment 7-2(c ) 

It was studied whether the osteoclastgenic inhibition by IL-18, confirmed in Experiments 7-2(a) and 7-2(b) was 
caused by the action of the IL-18-induced GM-CSF. For positive and negative controls, the same co-culture systems 
employed in Experiment 7-2(a) were used. Using other wells, the co-culture of osteoblasts and bone marrow cells was 

io carried out similarly as the method used for the positive controls except for using a medium supplemented with 1a, 
25-dihydroxyvitamin D 3 and prostaglandin E 2 at the same concentrations used in the positive control, and with (i) 10 
jag/ml of an anti-mouse GM-CSF polyclonal antibody commercialized by R&D Systems, Minnesota, USA, (ii) 10 ng/mi 
of an IL-18 preparation obtained by the method in Experiment 6, (Hi) (ii) plus 10 u^g/ml of an anti-mouse polyclonal 
antibody, (iv) 0.1 ng/ml of a mouse GM-CSF commercialized by R&D Systems, Minnesota, USA : or (v) (iv) plus 10 u.g/ 

'5 ml of an anti-mouse GM-CSF polyclonal antibody. After completion of the culture, TRAP-positive cells in each well 
were counted, and the numbers were compared similarly as in Experiment 7-2(a). The data is shown in Table 4 where 
the symbols V to "v" coincide with those used in the co-culture systems other than the control systems. 
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As shown in Table 4, the formation of TRAP-positive cells was almost completely inhibited by IL-18, cf., the co- 
culture system (ii) : but the inhibition was almost completely inhibited by the addition of the anti-mouse polyclonal an- 
tibody, cf., the co-culture system (iii) Mouse GM-CSF exhibited an activity of inhibiting the formation of TRAP-positive 
cells similar to IL-18, cf., the co-culture system (iv), and the inhibition was almost completely inhibited by the addition 
$ of the anti-mouse GM-CSF polyclonal antibody, cf ; the co-culture system (v). The sole use of the anti-mouse GM-CSF 
polyclonal antibody gave no influence on the formation of TRAP-positive cells, cf., the co-culture system (i). These 
data strongly indicates that the osteoclastgenic inhibition by IL-18 was due to the action of the IL-18-induced GM-CSF. 

Experiment 8 

w 

Acute toxicity test 

Eight-week-old mice were in a conventional manner injected percutaneously, orally, or intraperitoneally with either 
of IL-18 preparations obtained by the methods in Experiments 1 to 6. The results showed that these IL-18 preparations 
15 had an LD^ of about one mg/kg or more in mice independent of the route of administration. The data evidences that 
IL-18 can be incorporated into pharmaceuticals for warm-blooded animals in general and including humans without 
causing no serious side effects. 

As described in Nikkei Biotechnology Annual Report 1996, pp. 498-499 (1995), published by Nikkei BP Publisher, 
Tokyo, Japan (1995), the IL-18-induced GM-CSF has not yet been clinically used in Japan, but applied clinically in 
20 USA and Europe. The fact would show that IL-18 has substantially no serious side effects. These facts indicate that 
the osteoclastgenic inhibitory agent according to the present invention can be successively administered to warm- 
blooded animals in general and including humans to induce osteoclast formation and exert a satisfactory therapeutic 
and/or prophylactic effect on osteoclast-related diseases without causing serious side effects. 

The following Examples describe the present osteoclastgenic inhibitory agent according to the present invention; 

25 

Example 1 
Liquid 

30 Either of IL-18 preparations: obtained by the methods in Experiments 1 to 6, was dissolved in physiological saline 

containing one w/v % human serum albumin as a stabilizer to give a concentration of two mg/ml of the IL-1 8 preparation. 
The resulting solutions were in a conventional manner membrane filtered for sterilization into liquids. 

The liquids have a satisfactory stability and can be arbitrarily used as ingredients for cell culture and agents in the 
form of an injection, ophthalmic solution, or collunarium for regulating bone resorption and for osteoclast-related dis- 

35 eases, directed to treat and/or prevent hypercalcemia, osteoclastoma, osteoporosis, etc. 

Example 2 
Dry agent 

40 

Fifty milligrams of either of IL-18 preparations, obtained by the methods in Experiments 1 to 6, was dissolved in 
100 ml of physiological saline containing one w/v % purified gelatin as a stabilizer The solutions thus obtained were 
in a conventional manner membrane filtered for sterilization, distributed to vials by one milliliter, lyophilized, and sealed 
with caps. 

•*s The products have a satisfactory stability and can be arbitrarily used as ingredients for cell culture and agents in 

the form of a dry injection for regulating bone resorption and for osteoclast-related diseases, directed to treat and/or 
prevent hypercalcemia, osteoclastoma, osteoporosis, etc. 

Example 3 

50 

Dry agent 

Fifty milligrams of either of IL-18 preparations, obtained by the methods in Experiments 1 to 6, was dissolved in 
100 ml of physiological saline containing one w/v % trehalose as a stabilizer. The solutions were in a conventional 
55 manner membrane filtered for sterilization, distributed to vials by one milliliter, lyophilized, and sealed with caps. 

The products have a satisfactory stability and can be arbitrarily used as ingredients for cell culture and agents in 
the form of a dry injection for regulating bone resorption and for osteoclast-related diseases, directed to treat and/or 
prevent hypercalcemia, osteoclastoma, osteoporosis, etc. 
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Example 4 
Ointment 

"HI VIS WAKO GEL® 104" : a carboxyvinyfpolymer commercialized by Wako Pure Chemical Industries, Ltd., Tokyo 
Japan, and a high-purity trehalose were dissolved in a sterilized distilled water to give respective concentrations of 1 .4 
w/w % and 2.0 w/w and the solution was mixed to homogeneity with either of IL-18 preparations obtained by the 
methods in Experiments 1 to 6, and adjusted to pH 7.2 to obtain a paste containing about one mg of an IL-1 8 preparation 
per g of the product. 

Each product thus obtained has a satisfactory spreadability and stability and can be arbitrarily used as an agent 
in the form of an ointment for regulating bone resorption and for osteoclast -related diseases, directed to treat and/or 
prevent hypercalcemia, osteoclastoma, osteoporosis, etc. 

Example 5 

Tablet 



"FINETOSE®", an anhydrous crystalline a-maltose powder commercialized by Hayashibara Biochemical Labora- 
tories, Inc., Okayama, Japan, was mixed to homogeneity with either of IL-18 preparations, obtained by the methods 
in Experiments 1 to 6, and "LUMIN" or 1-V-1 "-trihepthyl-11 -chinolyl(4)^4'-penthamethinchynocyanine-1 ,1"-dijodide. 
The mixtures were in a conventional manner tabletted to obtain tablets, about 200 mg weight each, containing an about 
two milligrams of either of the IL-18 preparations and an about two milligrams of LUMIN per tablet. 

The products have a satisfactory swallowability stability, and cell-activating activity and can be arbitrarily used as 
agents in the form of a tablet for regulating bone resorption and for osteoclast-related diseases, directed to treat and/ 
25 or prevent hypercalcemia, osteoclastoma, osteoporosis, etc 

As described above, the osteoclastgenic inhibitory agent according to the present invention effectively inhibits 
osteoclast formation. Therefore, the agent can be arbitrarily used as an ingredient for cell culture and agents for reg- 
ulating bone resorption and for osteoclast-related diseases, directed to treat and/or prevent hypercalcemia, osteoclas- 
toma, osteoporosis, etc 

so Thus the present invention with these useful activities and functions is a significant invention that would greatly 

contribute to this field 

While there has been described what is at present considered to be the preferred embodiments of the invention, 
it will be understood the various modifications may be made therein, and it is intended to cover in the appended claims 
all such modifications as fall within the true spirits and scope of the invention. 
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Annex to the description 



SEQUENCE LISTING 

(1) INFORMATION FOR SEQ ID NO: 1: 

( i ) SEQUENCE CHARACTERISTICS : 

( A ) LENGTH : 6 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
( v ) FRAGMENT TYPE: internal fragment 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Asn Asp Gin Val Leu Phe 

20 1 5 

(2) INFORMATION FOR SEQ ID NO: 2: 

( i ) SEQUENCE CHARACTERISTICS : 

( A ) LENGTH : 6 amino acids 

(B) TYFE: amino acid 
(D)TOPOLOGY: linear 

( ii ) MOLECULE TYPE: internal fragment 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Phe Glu Asp Met Thr Asp 
1 5 

(3) INFORMATION FOR SEQ ID NO: 3: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 
( v ) FRAGMENT TYPE: internal fragment 
( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Phe Lys Leu I le Leu Lys Lys 
1 5 

(4) INFORMATION FOR SEQ ID NO: 4: 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 5 amino acids 
( B ) TYPE : amino acid 
( D ) TOPOLOGY : 1 inear 

55 (ii)MOLECULE TYPE: internal fragment 
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(xi)SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Tyr Lys Asp Ser 
1 5 

(5) INFORMATION FOR SEQ ID NO: 5: 

( i ) SEQUENCE CHARACTERISTICS : 

( A ) LENGTH : 5 amino acids 

(B) TYPE: amino acid 
( D )TOPOLOGY: linear 

( ii ) MOLECULE TYPE: internal fragment 
(xi)SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ser Thr Leu Ser Cys 
1 5 

(6) INFORMATION FOR SEQ ID NO: 6: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY: linear 



( i i ) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


Arg 


Asn 


Leu 


Asn 


1 








5 










10 








15 




Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Arg 


Pro 


Leu 


Phe 


Glu 


Asn 








20 










25 








30 




■ sr 


Met 


Thr 


Asp 


Ser 


Asp 


Cys 


Arg 


Asp 


Asn 


Ala 


Pro 


Arg 


Thr 


He 


Phe 


He 






35 










40 








45 








lie 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 




50 










55 








60 










Ser 


Val 


Lys 


Cys 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Cys 


Glu 


Asn 


Lys 


He 


65 










70 










75 






80 


lie 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 










85 










90 








95 


Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 








100 










105 










110 




Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Cys 


Glu 






115 










120 










125 






Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 




130 










135 










140 








Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 








145 










150 










155 










(7) 


INFORMATION 


r for 


: SEQ 


! ID 


NO: 


7: 

















( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 157 amino acids 
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(B)TYPE: amino acid 
( D ) TOPOLOGY : linear 



(ii)MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Asn 


Phe 


Gly 


Arg 


Leu 


His 


Cys 


Thr 


Thr 


Ala 


Val 


He 


Arg 


Asn 


He 


Asn 


1 






5 










10 










15 




Asp 


Gin 


Val 


Leu 


Phe 


Val 


Asp 


•* 

Lys 


Arg 


Gin 


Pro 


Val 


Phe 


Glu 


Asp 


Met 






20 










25 










30 






Thr 


Asp 


He 


Asp 


Gin 


Ser 


Ala 


Ser 


Glu 


Pro 


Gin 


Thr 


Arg 


Leu 


He 


He 




35 










40 










45 








Tyr 


Met 


Tyr 


Lys 


Asp 


Ser 


Glu 


Val 


Arg 


Gly 


Leu 


Ala 


Val 


Thr 


Leu 


Ser 


50 










55 










60 










Val 


Lys 


Asp 


Ser 


Lys 


Met 


Ser 


Thr 


Leu 


Ser 


Cys 


Lys 


Asn 


Lys 


He 


He 


65 








70 










75 










80 


Ser 


Phe 


Glu 


Glu 


Met 
85 


Asp 


Pro 


Pro 


Glu 


Asn 
90 


He 


Asp 


Asp 


He 


Gin 

95 


Ser 


Asp 


Leu 


He 


Phe 


Phe 


Gin 


Lys 


Arg 


Val 


Pro 


Gly 


His 


Asn 


Lys 


Met 


Glu 






100 










105 










110 






Phe 


Glu 


Ser 
115 


Ser 


Leu 


Tyr 


Glu 


Gly 
120 


His 


Phe 


Leu 


Ala 


Cys 
125 


Gin 


Lys 


Glu 


Asp 


Asp 


Ala 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Lys 


Asp 


Glu 


Asn 


Gly 


Asp 


130 










135 










140 










Lys 


Ser 


Val 


Met 


Phe 


Thr 


Leu 


Thr 


Asn 


Leu 


His 


Gin 


Ser 








145 










150 










155 













( 8 ) INFORMATION FOR SEQ ID NO: 8: 

( i ) SEQUENCE CHARACTERISTICS : 

( A ) LENGTH : 471 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

( A ) ORGANISM; human 
(G)CELL TYPE: liver 

(ix) FEATURE: 

( A ) NAME/KEY: mat peptide 

(B) LOCATION: 1 . .471 

(C) IDENTIFICATION METHOD: E 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

TAC TTT GGC AAG CTT GAA TCT AAA TTA TCA GTC ATA AGA AAT TTG AAT 
Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val He Arg Asn Leu Asn 
15 10 15 

GAC CAA GTT CTC TTC ATT GAC CAA GGA AAT CGG CCT CTA TTT GAA GAT 
Asp Gin Val Leu Phe He Asp Gin Gly Asn Arg Pro Leu Phe Glu Asp 

20 25 30 

ATG ACT GAT TCT GAC TGT AGA GAT AAT GCA CCC CGG ACC ATA TTT ATT 
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w 



15 



20 



25 



30 



J5 



40 



45 



50 



50 



65 



Asp 


Ser 


Asp 


Cys 


Arcr 


Asp 


Asn 


Ala 


Prn 




Th -r- 
L 11 JT 


He 


Phe 


He 


35 










*±\j 








45 




ATG 


TAT 


AAA 


GAT 


AGC 


CAG 


PPT 


APA 


PPT 
LTV 1 


nib 


GCT 


GTA 


ACT 


ATC 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 










55 










DU 






AAG 


TGT 


GAG 


AAA 


ATT 


TCA 


ACT 


PTP 


TPP 




\jAG 


AAC 


AAA 


ATT 


Lys 


Cys 


Glu 


Lys 


lie 


Ser 


Thr 


\» u 


Q C> T- 


P-vre 


mil 


Asn 


Lys 


He 








70 










/ D 








80 


TTT 


AAG 


GAA 


ATG 


AAT 


PPT 


PPT 


PAT 




A1L 


AAG 


GAT 


ACA 


AAA 


Phe 


Lys 


Glu 


Met 


Asn 


Prn 


Pm 


*\ LJ 


A cm 


x le 


Lys 


Asp 


try \— 

Thr 


Lys 






AS 










yo 








95 


ATP 


ATA 




111 




AGA 


AGT 


GTC 


CCA 


GGA 


CAT 


GAT 


AAT 


AAG 


Tip 

JL JL 


Tip 

j. jl 

t nn 


Pho 

rue 




\J in 


Arg 


Ser 
105 


val 


Pro 


Gly 


His 


Asp 
110 


Asn 


Lys 


TTT 


GAA 


TCT 


TCA 


TCA 


TAC 


GAA 


GGA 


TAC 


TTT 


CTA 


GCT 


TGT 


GAA 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Cys 


Glu 


115 










120 










125 




AGA 


GAG 


CTT 


TTT 


AAA 


CTC 


ATT 


TTG 


AAA 


AAA 


GAG 


GAT 


GAA 


TTG 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 










135 










140 






AGA 


TCT 


ATA 


ATG 


TTC 


ACT 


GTT 


CAA 


AAC 


GAA 


GAC 








Arg 


Ser 


lie 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 














150 










155 











130 

GGG GAT 
Gly Asp 
145 

(9) INFORMATION FOR SEQ ID NO: 9: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 11 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

( ii ) MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: N- terminal fragment 

(xi)SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser 
1 5 10 

(10) INFORMATION FOR SEQ ID NO: 10: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 10 amino acids 

( B ) TYPE : amino acid 
( D ) TO PO LOGY : linear 

(ii) MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: C- terminal fragment 

( xi ) S EQUENCE DESCRIPTION: SEQ ID NO: 10 

Ser He Met Phe Thr Val Gin Asn Glu Asp 
1 5 10 



192 



240 



288 



336 



384 



432 



471 
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w 



(11) INFORMATION FOR SEQ ID NO: 11: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 13 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: N- terminal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val lie Arg 
15 10 

(12) INFORMATION FOR SEQ ID NO: 12: 

( i ) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 14 amino acids 

(B)TYPE: amino acid 
( D ) TOPOLOGY : 1 i near 



15 



25 



(ii)MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: internal fragment 

(xi)SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Thr lie Phe lie lie Ser Met Tyr Lys Asp Ser Gin Pro Arg 
30 l 5 10 

(13) INFORMATION FOR SEQ ID NO: 13: 

( i ) SEQUENCE CHARACTERISTICS: 

35 (A) LENGTH: 17 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : 1 inear 

( ii ) MOLECULE TYPE: peptide 
( v ) FRAGMENT TYPE: internal fragment 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



40 



45 



50 



lie lie Ser Phe Lys Glu Met Asn Pro Pro Asp Asn lie Lys Asp Thr Lys 
1 5 10 15 

( 14) INFORMATION FOR SEQ ID NO: 14: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 471 base pairs 

( B ) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 



21 



EP 0 861 663 A2 



( ii ) MOLECULE TYPE: cDNA 

( Ix ) FEATURE: 

( A ) NAME/KEY : mat peptide 

(B) LOCATION: 1. .471 

(C) IDENTIFICATION METHOD: S 



(xi)SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



TAC 


TTT 


GGC 


AAG 


CTT 


GAA 


TCT 


AAA 


TTA 


TCA 


GTC 


ATA 


AGA 


AAT 


TTG 


AAT 


Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 

A- W 


r^x. y 


ft cn 


X-iti Li 


/\c> 1 1 


1 








5 










10 












GAC 


CAA 


GTT 


CTC 


TTC 


ATT 


GAC 


CAA 


GGA 


AAT 


CGG 


CCT 


CTA 


TTT 

X X X 


GAA 


(Z AT 


Asp 


Gin 


Val 


Leu 


Phe 


He 


ASD 


Gin 


Glv 


Asn 


Arn 


Pm 




Php 

i I ICS 




ASp 








20 










25 










-J V-/ 




ATG 


ACT 


GAT 


TCT 


GAC 


TCT 


AGA 


GAT 


AAT 


GCA 


CCC 


CGG 

v** 


ACC 


ATA 


TTT 


ATT 


Met 


Thr 


Asp 


Ser 


ASD 


Ser 


Arn 


A*;!*) 


A^n 


Ala 


Pro 




1 111 


X16 


OK a 


T 1 - 

xie 






35 










40 








45 






ATA 


AGT 


ATG 


TAT 


AAA 


GAT 


AGC 


CAG 


CCT 


AGA 


GGT 


ATG 


GCT 


GTA 


ACT 




lie 


Ser 
50 


Met 


Tvr 


Lvs 

xj jr vj 


Aso 


Ser 
55 


Gin 


Pro 


AT*fI 

i »-- L y 


Gl v 


60 




V ax 


i nr 


T 1 a 

lie 


TCT 


GTG 


AAG 


TCT 


GAG 


AAA 


ATT 


TCA 


ACT 


CTC 


TCC 


GCT 


GAG 


AAC 


AAA 


ATT 


Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Ala 


Glu 


Asn 


Lys 


He 


65 










70 










75 








80 


ATT 


TCC 


TTT 


AAG 


GAA 


ATG 


AAT 


CCT 


CCT 


GAT 


AAC 


ATC 


AAG 


GAT 


ACA 


AAA 


lie 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 










85 










90 










95 


AGT 


GAC 


ATC 


ATA 


TTC 


TTT 


CAG 


AGA 


AGT 


GTC 


CCA 


GGA 


CAT 


GAT 


AAT 


AAG 


Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 








100 










105 










110 




ATG 


CAA 


TTT 


GAA 


TCT 


TCA 


TCA 


TAC 


GAA 


GGA 


TAC 


TTT 


CTA 


GCT 


TGT 


GAA 


Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Cys 


Glu 






115 










120 










125 






AAA 


GAG 


AGA 


GAC 


CTT 


TTT 


AAA 


CTC 


ATT 


TTG 


AAA 


AAA 


GAG 


GAT 


GAA 


TTG 


Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 




130 










135 










140 








GGG 


GAT 


AGA 


TCT 


ATA 


ATG 


TTC 


ACT 


GTT 


CAA 


AAC 


GAA 


GAC 








Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 








145 










150 










155 











(15) INFORMATION FOR SEQ ID NO: 15: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
( D ) TOPO LOGY : 1 i near 

( ii ) MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: N- terminal fragment 

(xi)SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser 
15 lO 



22 
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( 16 ) INFORMATION FOR SEQ ID NO: 16: 

( i ) SEQUENCE CHARACTERISTICS : 

( A ) LENGTH ; 471 base pairs 

( B ) TYPE : nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 



10 



15 



(ii) MOLECULE TYPE: cDNA 

( ix ) FEATURE: 

( A ) NAME/ KEY : mat peptide 

(B) LOCATION: 1. -471 

(C) IDENTIFICATION METHOD: S 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



20 



25 



30 



35 



40 



45 



TAC 


TTT 

A A » 


GGC 


AAG 


CTT 


GAA 


TCT 


AAA 


TTA 


TCA 


GTC 


ATA 


AGA 


AAT 


TTG 


AAT 


48 


Tvr 


Phe 


Glv 


Lvs 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


Arg 


Asn 


Leu 


Asn 




1 








5 










10 










15 






GAC 


CAA 


GTT 


CTC 


TTC 


ATT 


GAC 


CAA 


GGA 


AAT 


CGG 


CCT 


CTA 


TTT 


GAA 


GAT 


96 


Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Arg 


Pro 


Leu 


Phe 


Glu 


Asp 








20 










25 










30 








ATG 


ACT 


GAT 


TCT 


GAC 


TCT 


AGA 


GAT 


AAT 


GCA 


ccc 


CGG 


ACC 


ATA 


TTT 


ATT 


144 


Met 


Thr 


Asp 


Ser 


Asp 


Ser 


Arg 


Asp 


Asn 


Ala 


Pro 


Arg 


Thr 


He 


Phe 


He 
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40 
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ATA 


AGT 


ATG 


TAT 


AAA 


GAT 


AGC 


CAG 


CCT 


AGA 


GGT 


ATG 


GCT 


GTA 


ACT 


ATC 


192 


He 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 
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60 












TCT 


GTG 


AAG 


TCT 


GAG 


AAA 


ATT 


TCA 


ACT 


CTC 


TCC 


GCT 


GAG 


AAC 


AAA 


ATT 


240 


Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Ala 


Glu 


Asn 


Lys 


He 




65 








70 










75 










80 




ATT 


TCC 


TTT 


AAG 


GAA 


ATG 


AAT 


CCT 


CCT 


GAT 


AAC 


ATC 


AAG 


GAT 


ACA 


AAA 


288 


He 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 












85 










90 










95 






AGT 


GAC 


ATC 


ATA 


TTC 


TTT 


CAG 


AGA 


AGT 


GTC 


CCA 


GGA. 


CAT 


GAT 


AAT 


AAG 


336 


Ser 


Asp 


He 


lie 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 










100 










105 










110 








ATG 


CAA 


TTT 


GAA 


TCT 


TCA 


TCA 


TAC 


GAA 


GGA 


TAC 


TTT 


CTA 


GCT 


TCT 


GAA 


384 


Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Ser 


Glu 








115 










120 










125 










AAA 


GAG 


AGA 


GAC 


CTT 


TTT 


AAA 


CTC 


ATT 


TTG 


AAA 


AAA 


GAG 


GAT 


GAA 


TTG 


432 


Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 




130 










135 










140 












GGG 


GAT 


AGA 


TCT 


ATA 


ATG 


TTC 


ACT 


GTT 


CAA 


AAC 


GAA 


GAC 








471 


Gly 


Asp 


Arg 


Ser 


lie 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 
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150 










155 














( 17 ) INFORMATION 


FOR 


SEQ 


ID ] 


NO: 


17: 



















50 



( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 11464 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TO PO LOGY : 1 i ne a r 



55 



23 
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( ii ) MOLECULE TYPE: genomic DNA 



w 



15 



20 



25 



30 



40 



( vi ) ORIGINAL SOURCE ; 

A) ORGANISM: human 
G)CELL TYPE: placenta 

( ix) FEATURE: 

A) NAME/KEY: 5" UTR 

B) LOCATION: 1..3 

C ) IDENTIFICATION METHOD: E 

A) NAME/KEY: leader peptide 

B) LOCATION: 4.. 82 

C) IDENTIFICATION METHOD: S 

A) NAME/KEY: intron 

B) LOCATION: 83. .1453 

C) IDENTIFICATION METHOD: E 

A ) NAME /KEY : leader peptide 

B) LOCATION: 1454. . 1465 

C) IDENTIFICATION METHOD: S 

A ) NAME /KEY : Intron 

B) LOCATION: 1466. .4848 

C) IDENTIFICATION METHOD: E 

A ) NAME /KEY : leader peptide 

B) LOCATION: 4849. .4865 

C) IDENTIFICATION METHOD: S 

A) NAME/KEY: mat peptide 

B ) LOCATION: 4866 . . 4983 

C) IDENTIFICATION METHOD: S 

A ) NAME /KEY : intron 

B) LOCATION: 4984. .6317 

C) IDENTIFICATION METHOD: E 

A) NAME/KEY: mat peptide 

B) LOCATION: 6318. -6451 

C) IDENTIFICATION METHOD: S 

A ) NAME/KEY : intron 

B) LOCATION: 6452. .11224 

C) IDENTIFICATION METHOD: E 

A) NAME/KEY: mat peptide 

B) LOCATION: 11225. .11443 

C) IDENTIFICATION METHOD: S 

A) NAME/KEY: 3" UTR 

B) LOCATION: 11444. . 11464 

C) IDENTIFICATION METHOD: E 

(xi)SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
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AAG ATG GCT GCT GAA CCA GTA GAA GAC AAT TGC 
Met Ala Ala Glu Pro Val Glu Asp Asn Cys 
-35 -30 
ATG AAA TTT ATT GAC AAT ACG CTT TAC TTT ATA 
Met Lys Phe lie Asp Asn Thr Leu Tyr Phe lie 

-20 -15 
AGAACAAATA CCAGGTTCAG ATAAATCTAT TCAATTAGAA 
ATTAAGTGAC TCTTTGTGTC ACCAAATTTC ACTGTAATAT 
GTGGACCTCT AGAAATTAAC CACAACATGT CCAAGGTCTC 
GTCCTGGCAC TTTAATCAGC AGTAGCTCAC TCTCCAGTTG 



ATC AAC TTT GTG GCA 
lie Asn Phe Val Ala 
-25 

G GTAAGG CTAATGCCAT 
Ala 
-10 

AAGATGTTGT GAGGTGAACT 
TAATGGCTCT TAAAAAAATA 
AGCACCTTGT CACACCACGT 
GCAGTAAGTG CACATCATGA 



48 



98 



158 
218 
278 
338 
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w 



15 



20 



25 



30 



o5 



40 
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AAATCCCAGT 

GTACAAAACT 

AGAGATTCTC 

AGTAAAAATT 

CTCTGCTCAC 

AGTAGCTGGG 

GAGACAGGGT 

CCTGGCTCGG 

AATTGATTCT 

ATTTGAAACC 

TGCAAAATAT 

GAGCCTAGGA 

GACAGTAGAC 

TTTTCTTAGG 

AAATTTATTT 

ATTACCTGAG 

ATAATATTCT 

GAGGAGTAGC 

TACATATTCT 



TTTCATGGGA 
GGGTGCATTC 
TAAATTTAGA 
GATTCTTTTT 
TGCAACCTCC 
ACTACAGGTG 
TTTGGCATGT 
GCTCCCAAAG 
TATGATTAAT 
TTCATTTAAA 
CCTGT GG AC A 
ATTTGAGCCT 
CCTGACACAC 
TGACTTTCCG 
TTAGTTACAT 
AACACACTAA 
GATGAAAGCC 
AAAAGTAAAA 
GTTTCTCTCT 



AAATCCCAGT 
AGGAAATACA 
GTTCCGTGAA 
TTTTTTTTCT 
ACCTCCTGGG 
CATCCCGCCA 
TGTCCAGGCT 
TGCTGGGATT 
CTCCTGTGAA 
AGCCTGAGCA 
CCTCCTACCT 
GCAGTGAGCT 
ACACACAAAA 
TTTAAGCAAT 
ATTGAAATTT 
GTCTGATAAG 
AAGACAGACC 
GCTAGAATGA 
TTTTCCCCCT 
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GT AG AAATG A 

CCATTGTCAG 

ATGTGGACTC 

CTCTAGCAAA 

ATAGACTTTG 

GGGGAGCGTT 

CTCCTCTCAG 

GCAGAAAATT 

GGCCAAAAGC 

GTAGAGTAGG 

GACGTTCTCT 

TAAGCACAAT 

ATACAGTAGC 

AGAAGTAGTA 

GTACCCATCA 

GGGTTCATAT 

ACTCTAATTG 

ACATGGCCAC 

ATGACTCTTT 

AG AAT TGATG 

TATTACAAGA 

CTGTGAATGG 

TGGAAAGGAG 

CATTCAGTGA 

AT GG AT GACG 

CACAAACTCT 

TATTGTAGGC 

AGGAAATGTG 

TTAAAGGATG 

GAAGAAAAAC 

GGTGTTGTGT 

AGCAACAAGG 

CAGATTGCAA 

TAAAAGCTTG 

TGATGGAGCA 



ATTTATTTTT 

CTGAGGAAAA 

AGTAGCACAG 

AG ATGCT TCT 

CCTGT TTC AT 

GCATAGGAAA 

AAATGCTTTG 

CTGGAAGTAG 

TGAAAGAAAC 

AGTAGGAGAC 

CACCCCAAGA 

ATGTATTAGC 

TGAATAAGAT 

TGGCTGGAAG 

TCCCCTAGTT 

CCCAGTTATC 

GAAGTTAAAC 

ACCAAGTGCA 

AAAATTCAGA 

GTGTGGGGTG 

AAT GATGGTG 

ATTTTAGAAA 

AAGAAAGTAG 

AATAGGGAAC 

CATTTCGTTT 

TCTACATGTG 

TTATACATAG 

TAAAGTGAGA 

CAGTAGAAAG 

CAAGAGAATT 

TGAATTTTGC 

AGTTTGGTGA 

TGAGTGAAAC 

GCTGTTAAAA 

GTTTTAAATC 



CTTTGCAAAC 
AAAAAAATGG 
CTTTGGAATG 
CTATGCCTTA 
TGGTCCTAAG 
AAGGG AT TG A 
GG AAGAAGCC 
AGGAGATAGG 
CATGGCATTT 
TGGTGAGAGG 
TGTGAAATTT 
TAGGGTAAAG 
AGAGAATTTT 
CAACCTGATG 
GTTGATCTCA 
AAGAAAGGGT 
ACATCAATCC 
AGGAAATCTG 
AAT AAT AT AT 
AGGAGGCCAA 
TCATGAATTA 
CACTTGAGAG 
AAAAGATGAT 
ACAGGAGGAA 
TGGATCTGAG 
GTTCTGAGTT 
AAATGGCATT 
GAGGAAAAGC 
AAGCTAATAA 
CCACCGACTC 
AGCCTT GAGA 
TCTCAGTGAA 
AGTGAATGGG 
GGAGGAGAGA 
TCAAAATAAA 



TTTCATTGGA 
ATTTCCCAAA 
TTACACCATT 
GTTGCCCAGG 
TTCAAGCAAT 
TGCCTGGCTA 
GGTCTTGGAC 
ACAGGCATGA 
CAATTTGGCT 
ACAAAGTGAG 
TCTGTGGAGG 
ATGATCCCAC 
AAAAACCTTC 
AAATTTAAAA 
TTAAACCCTA 
CTTCATTTTA 
CTTAAACCAT 
GATTGAATTC 
CTTAG CT 
Ala 
-10 

TAAGTATCTG 
TTCTCATGCT 
AAGATGATCA 
AAAAATTCTC 
ATTAGCATGA 
AGCATTAGAA 
TGGAAGGTTC 
AATGGGTGGG 
ATGATGAATT 
AGCTAGAGTG 
GGACTTTATC 
ATTAGTTTGT 
TCTCT C AAAG 
ATATTGGGAC 
CTCACATAGT 
CAAGAGAAGT 
CCCTCATATT 
GAAAATATAA 
TTTTAAAATA 
AATTAAGGGT 
AGGTAGACAT 
AATCAATAGG 
GCCTACATTT 
GAGCAGGTTT 
ATGTCTGTGG 
CAGGACACAG 
TGAATCTATA 
CAAGTACTGT 
ACAACAGAGA 
CCAGGAGAGC 
ATCAAGGGCC 
AGCAGCTTGA 
AAGTGAAGAA 
AACAAGACTA 
GAGCTTTGTG 



TTTCCATGGG AAAAATCCCA 
GCAAATTGGC AAATTATGTA 
TTATGTAAAT ATGTTTGACA 
CTGGAGTGCA GTGGCACAAT 
TCTCCTGCCT CAGCCTTCTG 
ATTTTTGGGT ATTTTTACTA 
TCCTGATCTC AGATGATCCT 
ACCACCACAC ATGGCCTAAA 
TCATTTGAAA GTTTGCCTTC 
ACCCCATCTC TACAAAAAAC 
CTGAAGCAGG AGGATCACTT 
CCCTACACTC CAGCCTGCAT 
ATAAAAAATT ATTAGTTGAC 
GTAAAATCTC TAATTTTAGA 
GGTTTAAGTT TTATGTCTAA 
TGGGCCTTTT GGATGATTAT 
AAAAATAGGA GTTCGAGAAA 
TGAGTCGAAA TACAAAATTT 
GAA GAT GAT G GTAAA 
Glu Asp Asp Glu 



CTTGAGACAC 

ACCAATCTGC 

T AAG AG AT AC 

CAGCTCTTAG 

AGCCATGGAT 

TTGTCCAAAA 

CGGGTTGGTG 

GC AAG AAG A C 

CAGGGTAATT 

ATAAACAGGG 

TTGGAGATAA 

TGTAACAAAG 

AAAGTCTAAG 

CCCCAACCTT 

TGAAAATCAT 

CAGGCTCATT 

CCATTGACTA 

TCTTTATTCC 

TCATTCTGGC 

TGAGAGCCTA 

AGGGGAGTGC 

ACATGATTTA 

TTCACTTAGG 

T GGTGT AT AC 

AACGTCCTAG 

ATTTGGGCTG 

GAGATAAAAA 

GCT GGGGGG A 

GCAGACTAAC 

ATTTCAAGAT 

AGAACACAGC 

TGGTGAAATG 

ATGATACAGA 

GCTGCAAAGT 

CTTTTTTGAT 



ATCTATCTCA 

CTTCAAAGAA 

AAAGAAGAAC 

AATCTACAAA 

TCTGTTGTAG 

TCAGTAACAC 

GTGGGGTGGG 

CACATTCAGA 

CAGAATGGAA 

TGTAGAGCAA 

TAGGGTTAAT 

ACATCCAAAG 

TAGGCAGCTC 

CTTCAGTCTT 

CATACTTCCT 

CCTTTCAAAG 

G AAT T T AAT C 

AGGTAGCCAT 

TTTGGTATAA 

TTATTTTAGT 

TGATGAGGAG 

GGGTTGGATT 

CAATTTGTAC 

AAAGAGGAGG 

TGGAGATGTC 

GAGATAGAGA 

GACACATCAG 

ATACCTACAT 

CAAAAGGGGA 

TGAGGGG AT A 

TTTTAGATTT 

GAGGCAGAGG 

TAATTCTTGC 

GAGATTGGGT 

TATGAAAATA 



398 
458 
518 
578 
638 
698 
758 
818 
878 
938 
998 
1058 
1118 
1178 
1238 
1298 
1358 
1418 
1470 



1530 

1590 

1650 

1710 

1770 

1830 

1890 

1950 

2010 

2070 

2130 

2190 

2250 

2310 

2370 

2430 

2490 

2550 

2610 

2670 

2730 

2790 

2850 

2910 

2970 

3030 

3090 

3150 

3210 

3270 

3330 

3390 

3450 

3510 

3570 
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20 
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ATGTGTTAAT TGTAACTAAT TGAGGCAATG AAAAAAGATA 
ATAAAAACCA CCCAGAAATA ATGATAGCTA CCATTTTGAT 
TCTATGTATA TATACAGACA CAGAAATGCT TATATTTTTA 
CCTAAGCTGC TTTTTCTAGT TAGTGATATA TATGGACATC 
TGCAGTTATA TTAAGTTCAT GATATTTCAC AATAAGGGCA 
AATCAATTCT TAATTGGTGA ATGTTTGTTT CCAGTTTGTT 
CATAAGCATT CCTGTACACC AATGTTCACA CATTTGTCTG 
CCCAGGAGGT AGAATTGCTG GGTTGATAGA AGAGAAAGGA 
CAGTAGAGGG TACATGCCGA GCACAAATGG GATCAGCCCT 
TCTCATTTCC CCTTGGGACA AAA GGGAGAG AGGCAATAAC 
TGTACGTGGA GTAGCAGGAA ATCATTTGCT GAAAATGAAA 
GTCCTGAAGA GAGCAAAGAA AATTTGAAAT TGCGGCTATC 
AACTGGAAAA CAAAAGAAGT ATTGACAATT GGTATGCTTG 
CTTGTGCCAT TGTTCACCAG CAGCACTCAG CAGCCAAGTT 
ACAAATAAGT TAGGGATTTA ATATCCTGGC CAAATGGT AG 
CAGCTGCACA GGGAAGGAAG GGAAGACGGG AAGAGGTTAG 
GAGACTGGAA GATGTTGTGA TATTTAAGAA CACATAGAGT 
ACTAGAAGGG TAAGAGACCG GTCAGAAAGT AGGCTATTTG 
GAGTAGTTCT GAATGGTAAC AAGAAATTGA GTGTGCCTTT 
TAGGCAACTT TATTGTAGCT ACT TCTGG AA CAGAAGATTG 
CTAAAATATA TAGCATACTT ATTTGTCAAT TAACAAAGAA 
ATTTAATGTT TATTGTAG AA AAC CTG GAA TCA GAT 

Glu Asn Leu Glu Ser Asp 
-5 

GAA TCT AAA TTA TCA GTC ATA AGA AAT TTG AAT 
Glu Ser Lys Leu Ser Val lie Arg Asn Leu Asn 

10 15 
ATT GAC CAA GGA AAT CGG CCT CTA TTT GAA GAT 
lie Asp Gin Gly Asn Arg Pro Leu Phe Glu Asp 

25 30 
TGT AGA G GTATTTTTT TTAATTCGCA AACATAGAAA 
Cys Arg Asp 
40 

TTCTGTTTTA CTGCTTACAT TGTTCCGTGC TAGTCCCAAT 
GAGTGACAAT AATTTCACTT ACAGGAAACT TTATAAGGCA 
TAAAAAATTG GATACAATAA GACATTGCTA GGGGTCATGC 
ATCACCAATC CCTTTATTGT GATTGCATTA AC TGT TT AAA 
AATCCCTGCT TGTTACAGCT GAAAATGCTG ATAGTTTACC 
GTAATCCTAG CTACTTGGGA GGCTCAAGCA GGAGGATTGC 
CTGTAGTACA CTGTGATCGT ACCTGTGAAT AGCCACTGCA 
AGACCTTGTC TCTAAAATTA AAAAAAAAAA AAAAAAAAAC 
AAGTCTACTG T GCCTTCC AA AACATGAATT CCAAATATCA 
AGTGAATGTG CATTCTTTAA AAATACTGAA TACTTACCTT 
TATTTAGCAT TTAAAAGTTA AAAACAATCT TTTAGAATTC 
AAAGTT GCAG CGTGTGTGTT GTAATACACA TTAAACTGTG 
ATGCAGTTTC ACTCTGTCAC CCAGGCTGAA GTGCAGTGCA 
CTC ACT AC AA CCTCCACCTC CCACGTTCAA GCGATTCTCA 
GTGGGATTAC AGGCATGCAC CACTTACACC CGGCTAATTT 
GGGTTTCACC ATGTTGGCCA GGCTGGTCTC AAACCCCTAA 
TCAGCCTCCC AAACAAACAA ACAACCCCAC AGTTTAATAT 
CAACTTTTAT GAGTATTTTA ATG AT AT AGA TTATAAAAGG 
CTGGGATTAC AGGCATGAGC CACTGTGCCA GGCCTGAACT 
CCAGCTGTAC ATAGTCTCCT GCAGACTGGC CAAGTCTCAA 
GGACTATCCT TTGGTTAAAT TTCCGCAAAT GTTCCTGTGC 
TTCTCATTTA TTATATTTAT TTCAG AT AAT GCA CCC 

Asp Asn Ala Pro 



AT AAT AT GAA AGATAAAAAT 3630 

ACAATATTTC TACACTCCTT 3690 

TTAAAAGGGA TTGTACTATA 3750 

TCTCC AT GGC AACGAGTAAT 3810 

TATCTTTGCC CTTTTTATTT 3870 

GTTGTTATTA ACAATGTTCC 3930 

ATTTTTTCTT CAGGATAAAA 3990 

TGATTGCCAA ATTAAAGCTT 4050 

AGATACCAGA AATGGCACTT 4110 

TGTGCTGCCA GAGTTAAATT 4170 

ACAGAGATGA TGTTGTAGAG 4230 

AGCTATGGAA GAGAGTGCTG 4290 

TAATGGCACC GATTTGAACG 4350 

TGGAGTTTTG TAGCAGAAAG 4410 

AC AAAAT GAA CTCTGAGATC 4470 

ATAGGAAATA CAAGAGTCAG 4530 

TGGAGTAAAA GTGTAAGAAA 4590 

AAGTTAACAC TTCAGAGGCA 4650 

GAGAGTAGGT TAAAAAACAA 4710 

TCATTAATAG TTTTAGAAAA 4770 

ACTATGTATT TTTAAATGAG 4830 

TAC TTT GGC AAG CTT 4880 
Tyr Phe Gly Lys Leu 
1 5 

GAC CAA GTT CTC TTC 4928 
Asp Gin Val Leu Phe 

20 

ATG ACT GAT TCT GAC 4976 
Met Thr Asp Ser Asp 
35 

TGACTAGCTA CTTCTTCCCA 5032 



CCTCAGATGA AAAGTCACAG 5092 

TCCACGTTTT TTAGTTGGGG 5152 

CTCTCTGAGC CTGCCTTTGA 5212 

AGGTGTGGTG GCATCTATCT 5332 

TTGAGGCCAG GACTTTGAGG 5392 

CTCCAGCCTG GGT GAT AT AC 5452 

CTTAGGAAAG GAAATTGATC 5512 

AAGTTAGGCT GAGTTGAAGC 5572 

AAC AT AT ATT TTAAATATTT 5632 

ATATCTTTAA AAT ACT C AAA 5692 

GGGTTGTTTG TTTGTTTGAG 5752 

GTGCAGTGGT GTGATCTCGG 5812 

TGCCTCAGTC TCCCGAGTAG 5872 

TTGTATTTTT AGTAGAGCTG 5932 

CCTCAAGTGA TCTGCCTGCC 5992 

GTGTTACAAC ACACATGCTG 6052 

TTGTTTTTAA CTTTTAAATG 6112 

GTGTTTTTAA AAATGTCTGA 6172 

AGTGGGAACA GGTGTATTAA 6232 

AAGAATTCTT CTAACTAGAG 6292 

CGG ACC ATA TTT ATT 6343 
Arg Thr He Phe He 
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10 



25 



40 45 

ATA AGT ATG TAT AAA GAT AGC CAG CCT AGA GGT ATG GCT GTA ACT ATC 6391 
lie Ser Met Tyr Lys Asp Ser Gin Pro Arg Gly Met Ala Val Thr lie 

50 55 60 

TCT GTG AAG TGT GAG AAA ATT TCA ACT CTC TCC TGT GAG AAC AAA ATT 6439 
Ser Val Lys Cys Glu Lys lie Ser Thr Leu Ser Cys Glu Asn Lys lie 
65 70 75 80 

ATT TCC TTT AAG GTAAG ACTGAGCCTT ACTTTGTTTT CAATCATGTT AATATAATCA 6496 
lie Ser Phe Lys 

AT AT A ATT AG AAATATAACA TTATTTCTAA TGTTAATATA AGTAATGTAA T T AG AAAACT 6556 

CAAATATCCT CAGACCAACC TTTTGTCTAG AACAGAAATA ACAAGAAGCA GAGAACCATT 6616 

AAAGTGAATA CTTACTAAAA ATTATCAAAC TCTTTACCTA TTGTGATAAT GATGGTTTTT 6676 

CTGAGCCTGT CACAGGGGAA GAGGAGATAC AACACTTGTT TTATGACCTG CATCTCCTGA 6736 

ACAATCAGTC TTTATACAAA TAATAATGTA G AAT AC AT AT GTG AGT TATA CAT TT AAG AA 6796 

15 TAACATGTGA CTTTCCAGAA TGAGTTCTGC TATGAAGAAT GAAGCTAATT ATCCTTCTAT 6856 

ATTTCTACAC CTTTGTAAAT TATGATAATA TTTTAATCCC TAGTTGTTTT GTTGCTGATC 6916 

CTTAGCCTAA GTCT TAG AC A CAAGCTTGAG CTTCCAGTTG ATGTATGTTA TTTTTAATGT 6976 

TAATCTAATT GAATAAAAGT TATGAGATCA GCTGTAAAAG TAATGCTATA ATTATCTTCA 7036 

AGCCAGGTAT AAAGTATTTC TGGCCTCTAC TTTTTCTCTA TTATTCTCCA TTATTATTCT 7096 

CTATTATTTT TCTCTATTTC CTCCATTATT GTTAGATAAA CCACAATTAA CTATAGCTAC 7156 

20 AGACTGAGCC AGTAAGAGTA GCCAGGGATG CTTACAAATT GGCAATGCTT CAGAGGAGAA 7216 

TTCCATGTCA TGAAGACTCT TTTTGAGTGG AGATTTGCCA ATAAATATCC GCTTTCATGC 7276 

CCACCCAGTC CCCACTGAAA GACAGTTAGG ATATGACCTT AGTGAAGGTA CCAAGGGGCA 7336 

ACT TGGT AGG GAGAAAAAAG CCACTCTAAA ATATAATCCA AGTAAGAACA GTGCATATGC 7396 

AACAGATACA GCCCCCAGAC AAATCCCTCA GCTATCTCCC TCCAACCAGA GTGCCACCCC 7456 

TTCAGGTGAC AATTTGGAGT CCCCATTCTA GACCTGACAG GCAGCTTAGT TATCAAAATA 7516 

GCATAAGAGG CCTGGGATGG AAGGGTAGGG TGGAAAGGGT TAAGCATGCT GTTACTGAAC 7576 

AACATAATTA GAAGGGAAGG AGATGGCCAA GCTCAAGCTA TGTGGGATAG AGGAAAACTC 7636 

AGCTGCAGAG GCAGATTCAG AAACTGGGAT AAGTCCGAAC CTACAGGTGG ATTCTTGTTG 7696 

AGGG AG ACT G GTGAAAATGT TAAGAAGATG GAAATAATGC TTGGCACTTA GTAGGAACTG 7756 

GGCAAATCCA TATTTGGGGG AGCCTGAAGT TTATTCAATT TTGATGGCCC TTTTAAATAA 7816 

30 AAAGAATGTG GCTGGGCGTG GTGGCTCACA CCTGTAATCC CAGCACTTTG GGAGGCCGAG 7876 

GGGGGCGGAT CACCTGAAGT C AGG AGT TCA AGACCAGCCT GACCAACATG GAGAAACCCC 7936 

ATCTCTACTA AAAATACAAA ATTAGCTGGG CGTGGTGGCA TATGCCTGTA ATCCCAGCTA 7996 

CTCGGGAGGC TGAGGCAGGA GAATCTTTTG AACCCGGGAG GCAGAGGTTG CGATGAGCCT 805 6 

AGATCGTGCC ATTGCACTCC AGCCTGGGCA ACAAGAGCAA AACTCGGTCT CAAAAAAAAA 8116 

AAAAAAAAAG TGAAATTAAC CAAAGGCATT AGCTTAATAA TTTAATACTG TTTTTAAGTA 8176 

35 GGGCGGGGGG TGGCTGGAAG AGATCTGTGT AAATGAGGGA ATCTGACATT TAAGCTTCAT 8236 

CAGCATCATA GCAAATCTGC TTCTGGAAGG AACTCAATAA ATATTAGTTG GhGGGGGGGK 8296 

GAGAGTGAGG GGTGGACTAG GACCAGTTTT AGCCCTTGTC TTTAATCCCT TTTCCTGCCA 8356 

CTAATAAGGA TCTTAGCAGT GGTTATAAAA GTGGCCTAGG TTCTAGATAA TAAGATACAA 8416 

CAGGCCAGGC ACAGTGGCTC ATGCCTATAA TCCCAGCACT TTGGGAGGGC AAGGCGAGTG 8476 

40 TCTCACTTGA GATCAGGAGT TCAAGACCAG CCTGGCCAGC ATGGCGATAC TCTGTCTCTA 8536 

CTAAAAAAAA TACAAAAATT AGCCAGGCAT GGTGGCATGC ACCTGTAATC CCAGCTACTC 8596 

GTGAGCCTGA GGCAGAAGAA TCGCTTGAAA CCAGGAGGTG TAGGCTGCAG TGAGCTGAGA 8 656 

TCGCACCACT GCACTCCAGC CTGGGCGACA GAATGAGACT TTGTCTCAAA AAAAGAAAAA 8716 

GATACAACAG GCTACCCTTA TGTGCTCACC TTTCACTGTT GATTACTAGC TATAAAGTCC 8776 

TATAAAGTTC TTTGGTCAAG AACCTTGACA ACACTAAGAG GGATTTGCTT TGAGAGGTTA 8836 

45 CTGTCAGAGT CTGTTTCATA TATATACATA TACATGTATA TATGTATCTA TATCCAGGCT 889 6 

TGGCCAGGGT TCCCTCAGAC TTTCCAGTGC ACTTGGGAGA TGTTAGGTCA ATATCAACTT 8956 

TCCCTGGATT CAGATTCAAC CCCTTCTGAT GTAAAAAAAA AAAAAAAAAA GAAAGAAATC 9016 

CCTTTCCCCT TGGAGCACTC AAGTTTCACC AGGTGGGGCT TTCCAAGTTG GGGGTTCTCC 907 6 

AAGGTCATTG GGATTGCTTT CACATCCATT TGCTATGTAC CTTCCCTATG ATGGCTGGGA 9136 

SO GTGGTCAACA TCAAAAC TAG GAAAGCTACT GCCCAAGGAT GTCCTTACCT CTATTCTGAA 919 6 

ATGTGCAATA AGTGTGATTA AAGAGATTGC CTGTTCTACC TATCCACACT CTCGCTTTCA 9256 

ACTGTAACTT TCTTTTTTTC TTTTTTTCTT TTTTTCTTTT TTTTTGAAAC GG AGTCTCGC 9316 
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10 



15 



20 



25 



30 



40 



45 



TCTGTCGCCC AGGCTAGAGT GCAGTGGCAC GATCTCAGCT CACTGCAAGC TCTGCCTCCC 
GGGTTCACGC CATTCTCCTG CCTCACCCTC CCAAGCAGCT GGGACTACAG GCGCCTGCCA 
CCATGCCCAG CTAATTTTTT GTATTTTTAG TAGAGACGGG GTTTCACCGT GTTAGCCAGG 
ATGGTCTCGA TCTCCTGAAC TTGTGATCCG CCCGCCTCAG CCTCCCAAAG TGCTGGGATT 
ACAGGCGTGA GCCATCGCAC CCGGCTCAAC TGTAACTTTC TATACTGGTT CATCTTCCCC 
TGTAATGTTA CTAGAGCTTT TGAAGTTTTG GCTATGGATT ATTTCTCATT TATACATTAG 
ATTTCAGATT AGTTCCAAAT TGATGCCCAC AGCTTAGGGT CTCTTCCTAA ATTGTATATT 
GTAGACAGCT GCAGAAGTGG GTGCCAATAG GGGAACTAGT TTATACTTTC ATCAACTTAG 
GACCCACACT TGTTGATAAA GAACAAAGGT CAAGAGTTAT GACTACTGAT TCCACAACTG 
ATTGAGAAGT TGGAGATAAC CCCGTGACCT CTGCCATCCA GAGTCTTTCA GGCATCTTTG 
AAGGATGAAG AAATGCTATT TTAATTTTGG AGGTTTCTCT ATCAGTGCTT AGGATCATGG 
GAATCTGTGC TGCCATGAGG CCAAAATTAA GTCCAAAACA TCTACTGGTT CCAGGATTAA 
CATGGAAGAA CCTTAGGTGG TGCCCACATG TTCTGATCCA TCCTGCAAAA TAGACATGCT 
GCACTAACAG GAAAAGTGCA GGCAGCACTA CCAGTTGGAT AACCTGCAAG ATTATAGTTT 
CAAGTAATCT AACCATTTCT CACAAGGCCC TATTCTGTGA CTGAAACATA CAAGAATCTG 
CAT TTGGCCT TCTAAGGCAG GGCCCAGCCA AGGAGACCAT ATTCAGGACA GAAATTCAAG 
ACT ACT ATGG AACTGGAGTG CTTGGCAGGG AAGACAGAGT CAAGGACTGC CAACTGAGCC 
AATACAGCAG GCTTACACAG GAACCCAGGG CCTAGCCCTA CAACAATTAT TGGGTCTATT 
CACTGTAAGT TTTAATTTCA GGCTCCACTG AAAGAGTAAG CTAAGATTCC TGGCACTTTC 
TGTCTCTCTC ACAGTTGGCT CAGAAATGAG AACTGGTCAG GCCAGGCATG GTGGC TT AC A 
CCTGGAATCC CAGCACTTTG GGAGGCCGAA GTGGGAGGGT CACTTGAGGC CAGGAGTTCA 
GGACCAGCTT AGGCAACAAA GTGAGATACC CCCTGACCCC TTCTCTACAA AAATAAATTT 
TAAAAATTAG CCAAATGTGG TGGTGTATAC TTACAGTCCC AGCTACTCAG GAGGCTGAGG 
CAGGGGGATT GCTTGAGCCC AGGAATTCAA GGCTGCAGTG AGCTATGATT TCACCACTGC 
ACT TCTGGCT GGGCAACAGA GCGAGACCCT GTCTCAAAGC AAAAAGAAAA AGAAACTAGA 
ACTAGCCTAA GTTTGTGGGA GGAGGTCATC ATCGTCTTTA GCCGTGAATG GTTATTATAG 
AGGACAGAAA TTG ACAT TAG CCCAAAAAGC TTGTGGTCTT TGCTGGAACT CTACTTAATC 
TTGAGCAAAT GTGGACACCA CTCAATGGGA GAGGAGAGAA GTAAGCTGTT TGATGTATAG 
GGG AAAACT A GAGGCCTGGA ACTGAATATG CATCCCATGA CAGGGAGAAT AGGAGATTCG 
GAGTTAAGAA GGAGAGGAGG TCAGTACTGC TGTTCAGAGA TTTTTTTTAT GTAACTCTTG 
AGAAGCAAAA CTACTTTTGT TCTGTTTGGT AATATACTTC AAAACAAACT TCATATATTC 
AAATTGTTCA TGTCCTGAAA TAATTAGGTA ATGTTTTTTT CTCTATAG GAA ATG AAT 

Glu Met Asn 
85 

CCT CCT GAT AAC ATC AAG GAT ACA AAA AGT GAC ATC ATA TTC TTT CAG 
Pro Pro Asp Asn lie Lys Asp Thr Lys Ser Asp lie lie Phe Phe Glu 

90 95 100 

AGA AGT GTC CCA GGA CAT GAT AAT AAG ATG CAA TTT GAA TCT TCA TCA 
Arg Ser Val Pro Gly His Asp Asn Lys Met Gin Phe Glu Ser Ser Ser 

105 110 115 

TAC GAA GGA TAC TTT CTA GCT TGT GAA AAA GAG AGA GAC CTT TTT AAA 
Tyr Glu Gly Tyr Phe Leu Ala Cys Glu Lys Glu Arg Asp Leu Phe Lys 
120 125 130 135 

CTC ATT TTG AAA AAA GAG GAT GAA TTG GGG GAT AGA TCT ATA ATG TTC 
Leu lie Leu Lys Lys Glu Asp Glu Leu Gly Asp Arg Ser lie Met Phe 

140 145 150 

ACT GTT CAA AAC GAA GAC TAGCTATTAA AATTTCATGC C 
Thr Val Gin Asn Glu Asp 

155 



9376 
9436 
9496 
9556 
9616 
9676 
9736 
9796 
9856 
9916 
9976 
10036 
10096 
10156 
10216 
10276 
10336 
10396 
10456 
10516 
10576 
10636 
10696 
10756 
10816 
10876 
10936 
10996 
11056 
11116 
11176 
11233 



11281 



11329 



11377 



11425 



11464 



( 18 ) INFORMATION FOR SEQ ID NO: 18 



50 



( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 471 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 
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( D) TOPOLOGY: linear 

( ii ) MOLECULE TYPE: cDNA to mRNA 

(vi)ORIGINAL SOURCE: 

( A )ORGANISM: mouse 
(G)CELL TYPE: liver 

( ix ) FEATURE : 

( A ) NAME/KEY: mat peptide 

(B) LOCATION: 1. .471 

(C) IDENTIFICATION METHOD: S 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



AAC TTT GGC CGA CTT CAC TGT ACA ACC GCA GTA ATA CGG AAT ATA AAT 48 
Asn Phe Gly Arg Leu His Cys Thr Thr Ala Val lie Arg Asn lie Asn 
1 5 10 15 

GAC CAA GTT CTC TTC GTT GAC AAA AGA CAG CCT GTG TTC GAG GAT ATG 96 
Asp Gin Val Leu Phe Val Asp Lys Arg Gin Pro Val Phe Glu Asp Met 
20 20 25 30 

ACT GAT ATT GAT CAA AGT GCC AGT GAA CCC CAG ACC AGA CTG ATA ATA 144 
Thr Asp lie Asp Gin Ser Ala Ser Glu Pro Gin Thr Arg Leu lie lie 

35 40 45 

TAC ATG TAC AAA GAC AGT GAA GTA AGA GGA CTG GCT GTG ACC CTC TCT 192 
Tyr Met Tyr Lys Asp Ser Glu Val Arg Gly Leu Ala Val Thr Leu Ser 

50 55 60 

GTG AAG GAT AGT AAA ATG TCT ACC CTC TCC TGT AAG AAC AAG ATC ATT 240 
Val Lys Asp Ser Lys Met Ser Thr Leu Ser Cys Lys Asn Lys lie lie 
65 70 75 80 

TCC TTT GAG GAA ATG GAT CCA CCT GAA AAT ATT GAT GAT ATA CAA AGT 288 
Ser Phe Glu Glu Met Asp Pro Pro Glu Asn lie Asp Asp lie Gin Ser 

85 90 95 

GAT CTC ATA TTC TTT CAG AAA CGT GTT CCA GGA CAC AAC AAG ATG GAG 336 
Asp Leu lie Phe Phe Gin Lys Arg Val Pro Gly His Asn Lys Met Glu 

100 105 110 

TTT GAA TCT TCA CTG TAT GAA GGA CAC TTT CTT GCT TGC CAA AAG GAA 384 
Phe Glu Ser Ser Leu Tyr Glu Gly His Phe Leu Ala Cys Gin Lys Glu 

115 120 125 

GAT GAT GCT TTC AAA CTC ATT CTG AAA AAA AAG GAT GAA AAT CGG GAT 432 
Asp Asp Ala Phe Lys Leu lie Leu Lys Lys Lys Asp Glu Asn Gly Asp 

130 135 140 

AAA TCT GTA ATG TTC ACT CTC ACT AAC TTA CAT CAA AGT 471 
Lys Ser Val Met Phe Thr Leu Thr Asn Leu His Gin Ser 
145 150 155 

(19) INFORMATION FOR SEQ ID NO: 19: 

^ ( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 9 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 



( ii ) MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: N-terminal fragment 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Asn Phe Gly Arg Leu His Cys Thr Thr 
1 5 

(20) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val He Arg Asn Leu Asn 

15 10 15 

Asp Gin Val Leu Phe He Asp Gin Gly Asn Arg Pro Leu Phe Glu Asp 

20 25 30 

Met Thr Asp Ser Asp Cys Arg Asp Asn Ala Pro Arg Thr lie Phe He 

35 40 45 

He Ser Met Tyr Lys Asp Ser Gin Pro Arg Gly Met Ala Val Thr He 

50 55 60 

Ser Val Lys Ser Glu Lys He Ser Thr Leu Ser Cys Glu Asn Lys He 
65 70 75 80 

He Ser Phe Lys Glu Met Asn Pro Pro Asp Asn He Lys Asp Thr Lys 

85 90 95 

Ser Asp He He Phe Phe Gin Arg Ser Val Pro Gly His Asp Asn Lvs 

100 105 no 

Met Gin Phe Glu Ser Ser Ser Tyr Glu Gly Tyr Phe Leu Ala Cys Glu 

115 120 125 

Lys Glu Arg Asp Leu Phe Lys Leu He Leu Lys Lys Glu Asp Glu Leu 
130 135 140 

CZ 1 "V Acn A-r-r*r Q/a-r* T 1 s\ M^4- nk« mi^ *r_i i _ . , ~ 

u ^ - ^ ww^. j-o.^ ucl xt a ic X 11 J- V dJL VJXIi /VSQ GlU ASp 

145 150 155 

(21) INFORMATION FOR SEQ ID NO: 21: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val He Arg Asn Leu Asn 

5 10 15 

Asp Gin Val Leu Phe He Asp Gin Gly Asn Arg Pro Leu Phe Glu Asn 

20 25 30 

Met Thr Asp Ser Asp Ser Arg Asp Asn Ala Pro Arg Thr He Phe He 

35 40 45 

He Ser Met Tyr Lys Asp Ser Gin Pro Arg Gly Met Ala Val Thr He 
50 55 60 
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Ser 


Val 


Lys 


Ser 


Glu 


Lys 


lie 


ber 


i nr 


Lieu 






blU 


Asn 


Lys 


jl xe 


65 








70 










/ D 










OVJ 


I le 


ber 


r ne 


L.ys 




l it; Kr 




Pro 


Pro 


Aso 


Asn 


He 


Lvs 


Aso 


Thr 


Lvs 








85 










90 










95 




Ser 


Asp 


lie 


lie 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 






100 










105 










110 






Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Cys 


Glu 




115 










120 










125 








Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 


130 








135 










140 










Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 









145 150 155 

(22) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


Arg 


Asn 


Leu 


Asn 


1 






5 










10 










15 




Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Arg 


Pro 


Leu 


Phe 


Glu 


Asp 






20 










25 










30 






Met 


Thr 


Asp 
35 


Ser 


Asp 


Cys 


Arg 


Asp 
40 


Asn 


Ala 


Pro 


Arg 


Thr 
45 


He 


Phe 


He 


lie 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 




50 








55 










60 










Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Cys 


Glu 


Asn 


Lys_ He 


65 








70 










75 










80 


lie 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 








85 










90 










95 




Ser 


Asp 


lie 


lie 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 




100 










105 










110 






Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Ser 


Glu 






115 










120 










125 




Glu 




Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Leu 


130 










135 










140 










Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 








145 










150 










155 













(23) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
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Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


1 








5 








10 




Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Aro 


Pro 








20 










25 






Met: 


Thr 


Asp 


Ser 


Asp 


Ser 


Arg 


Asp 


Asn 


Ala 


Pro 


A ttt 






35 










40. 








lie 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arcr 


Glv 




Ser 


50 










55 






60 


Val 


Lys 


Ser 


Glu 


Lys 


lie 


Ser 


Thr 


Leu 


Ser 




65 










70 










75 


lie 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


J\ cr r~\ 


A T - * 


Tic* 

lie 


Ser 








85 










90 






Asp 


lie 


lie 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 








100 










105 






Met: 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 






115 










120 






Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


lie 


Leu 


Lys 


Lys 


Gly 


130 










135 








140 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


145 










150 










155 



15 

Leu Phe Gl 
30 

Thr He Phi 
45 



.e 
,e 



80 

Lys Asp Thr Ly: 

95 

His Asp Asn Ly: 
110 

Leu Ala Ser Gli 
125 



(24) INFORMATION FOR SEQ ID NO: 24; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


Arg 


1 








5 










10 






Asp 


Gin 


Val 


Leu 


Phe 


He 


Aso 


Gin 


Gly 


A c:r-i 


A -r~t~r 


o »-«^» 


r r~\ * < 








20 








25 






1-1*3. V> 


Met 


Thr 


Asp 


Ser 


Asp 


Ser 


Arg 


Asp 


Asn 


Ala 


Pro 


Arg 


Thr 


lie 




35 










40 








45 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Ser 


50 










55 






60 




Val 


Lys 


Ser 


Glu 


Lys 


lie 


Ser 


Thr 


Leu 


Ser 


Ser 


Glu 


65 










70 










75 




He 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Ser 








85 










90 






Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 








100 










105 








Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Lys 


Glu 


115 










120 








125 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


lie 


Leu 


Lys 


Lys 


Glu 


Gly 
145 


130 










135 








140 




Asp 


Arg 


Ser 


He 


Met 
150 


Phe 


Thr 


Val 


Gin 


Asn 
155 


Glu 


Asp 



15 

30 



X~ 1 1C VjJLU ASp 



80 

Asp Thr Ly: 
95 

Asp Asn Ly; 
110 



(25) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



Tyr 
1 

Asp 


Phe 


Gly 


Lys 


Leu 

c 
O 


Glu 


J—x. -V - * 


Lys 


T 1 1 

L6U 


C q t* 

10 


V <i JL. 


Tip 
.1- 




Asn 


Leu 
15 


Asn 


Gin 


Val 


Leu 

ZU 


Pne 


x j_e 




v» J- 1 1 


25 


no 1 1 


A TTT 


Pro 


Leu 


Phe 
30 


Glu 


ASD 


Met 


Thr 


Asp 


Ser 


Asp 


y*v "V* 

OCX 


Arg 




no i i 


Ala 


Pro 


Arcr 


Thr 


He 


Phe 


He 




35 










40 










45 








He 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 




50 








55 










60 


Glu 






He 


Ser 


Val 


Lys 


Ser 


Glu 


Lys 


lie 


Ser 


Thr 


Leu 


Ser 


Ala 


Asn 


Lys 


65 








70 










75 










80 


He 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 






85 










90 










95 




Ser 


Asp 


lie 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 




100 










105 










110 




Glu 


Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Cys 




115 










120 










125 








Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


lie 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 


130 








135 










140 










Gly 


Asp 


Arg 


Ser 


lie 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 








145 








150 










155 













(26) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Tyr 
1 


Phe 


Gly 


Lys 


Leu 
5 


Glu 


Ser 


Lys 


Leu 


Ser 
10 


Val 


He 


Arg 


Asn 


Leu 
15 


Asn 


Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Arg 


Pro 


Leu 


Phe 


Glu 


Asp 






20 










25 










30 






Met 


Thr 


Asp 


Ser 


Asp 


Ser 


Arg 


Asp 


Asn 


Ala 


Pro 


Arg 


Thr 


He 


Phe 


He 




35 










40 










45 






He 


He 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 




50 








55 










60 








He 


Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Ala 


Glu 


Asn 


Lys 


65 








70 










75 










80 


He 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 






85 










90 










95 




Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 




100 










105 










110 






Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Ser 


Glu 




115 










120 










125 








Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 
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130 135 140 

Gly Asp Arg Ser He Met Phe Thr Val Gin Asn Glu As 
145 150 155 

(27) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



Asn 


Phe 


Gly 


Arg 


Leu 


His 


Ala 


Thr 


Thr 


Ala 


Val 


lie 


Arg 


Asn 


lie 


Asn 


1 








5 










10 








15 




Asp 


Gin 


Val 


Leu 


Phe 


Val 


Asp 


Lys 


Arg 


Gin 


Pro 


Val 


Phe 


Glu 


Asp 


Met 


Thr 






20 










25 










30 




Asp 


lie 


Asp 


Gin 


Ser 


Ala 


Ser 


Glu 


Pro 


Gin 


Thr 


Arg 


Leu 


lie 


He 






35 










40 










45 






Tyr 


Met 


Tyr 


Lys 


Asp 


Ser 


Glu 


Val 


Arg 


Gly 


Leu 


Ala 


Val 


Thr 


Leu 


Ser 




50 










55 










60 






Val 


Lys 


Asp 


Ser 


Lys 


Met 


Ser 


Thr 


Leu 


Ser 


Cys 


Lys 


Asn 


Lys 


lie 


He 


65 










70 










75 






80 


Ser 


Phe 


Glu 


Glu 


Met 


Asp 


Pro 


Pro 


Glu 


Asn 


lie 


Asp 


Asp 


lie 


Gin 


Ser 










85 










90 








95 




Asp 


Leu 


lie 


Phe 
100 


Phe 


Gin 


Lys 


Arg 


Val 
105 


Pro 


Gly 


His 


Asn 


Lys 
110 


Met 


Glu 


Phe 


Glu 


Ser 


Ser 


Leu 


Tyr 


Glu 


Gly 


His 


Phe 


Leu 


Ala 


Cys 


Gin 


Lys 


Glu 






115 










120 










125 






Asp 


Asp 


Ala 


Phe 


Lys 


Leu 


lie 


Leu 


Lys 


Lys 


Lys 


Asp 


Glu 


Asn 


Gly 


Asp 




130 










135 










140 






Lvs 






Mn-f- 


Phci 












IT." — 


-» 


Ser 








145 










150 








155 











(28) INFORMATION FOR SEQ ID NO: 28: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



Asn 


Phe 


Gly 


Arg 


Leu 


His 


Cys 


Thr 


Thr 


Ala 


Val 


lie 


Arg 


Asn 


lie 


Asn 


1 


Gin 






5 










10 








15 




Asp 


Val 


Leu 


Phe 


Val 


Asp 


Lys 


Arg 


Gin 


Pro 


Val 


Phe 


Glu 


Asp 


Met 


Thr 






20 










25 










30 




Asp 


He 
35 


Asp 


Gin 


Ser 


Ala 


Ser 
40 


Glu 


Pro 


Gin 


Thr 


Arg 
45 


Leu 


lie 


He 


Tyr 


Met 


Tyr 


Lys 


Asp 


Ser 


Glu 


Val 


Arg 


Gly 


Leu 


Ala 


Val 


Thr 


Leu 


Ser 




50 










55 










60 








Val 


Lys 


Asp 


Ser 


Lys 


Met 


Ser 


Thr 


Leu 


Ser 


Cys 


Lys 


Asn 


Lys 


lie 


He 



34 



EP 0 861 663 A2 



65 










70 






C £i y- 




Glu 

V* ^p* 


Glu 


Met: 


Asp 


Pro 


Pro 










85 








Asp 


Leu 


lie 


Phe 


Phe 


Gin 


Lys 


Arg 






100 










Phe 


Glu 


Ser 


Ser 


Leu 


Tyr 


Glu 


Gly 






115 










120 


Asp 


Asp 


Ala 


Phe 


Lys 


Leu 


lie 


Leu 


130 










135 




Lys 


Ser 


Val 


Met 


Phe 


Thr 


Leu 


Thr 


145 










150 







75 80 
Glu Asn lie Asp Asp lie Gin Ser 

90 95 
Val Pro Gly His Asn Lys Met Glu 
105 110 
His Phe Leu Ala Ser Gin Lys Glu 

125 

Lys Lys Lys Asp Glu Asn Gly Asp 

140 

Asn Leu His Gin Ser 
155 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

NAME: KABUSHIKI KAISHA HAYASHIBARA SEIBUTSU KAGA 
KENKYUJO 

(ii) TITLE OF INVENTION: OSTEOCLASTGENIC INHIBITORY AGENT 
(iii) NUMBER OF SEQUENCES: 28 
(iv) ADDRESS: 

(A) ADDRESSEE: KABUSHIKI KAISHA HAYASHIBARA SEIBUTSU 

KAGAKU KENKYUJO 

(B) STREET : 2 - 3 , 1-CHOME, SHIMOISHII 

(C) CITY : OKAYAMA 
{ E ) COUNTRY : JAPAN 
(F) POSTAL CODE (ZIP) : 700 

COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 
<C) OPERATING SYSTEM: PC -DOS/MS -DOS 

PRIOR APPLICATION DATA: 

(Al) APPLICATION NUMBER: JP 55,468/1997 
(Bl) FILING DATE: 25-FEB-1997 

(2) INFORMATION FOR SEQ ID NO: 1: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE : amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal fragment 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

Asn Asp Gin Val Leu Phe 
1 5 

(3) INFORMATION FOR SEQ ID NO: 2: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

( B ) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: internal fragment 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Phe Glu Asp Met Thr Asp 
1 5 

(4) INFORMATION FOR SEQ ID NO : 3: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 amino acids 

( B ) TYPE : amino acid 
(D) TOPOLOGY: linear 



(v) 



(vii) 
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(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Phe Lys Leu lie Leu Lys Lys 
1 5 

(5) INFORMATION FOR SEQ ID NO : 4: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: internal fragment 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 

Met Tyr Lys Asp Ser 
1 5 

(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ser Thr Leu Ser Cys 
1 5 

(7) INFORMATION FOR SEQ ID NO : 6: 



( i ) S EQUENCE CHARACTERI STICS : 

(A) LENGTH: 157 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


Arg 


Asn 


Leu 


Asn 


1 






5 










10 










15 




Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Arg 


Pro 


Leu 


Phe 


Glu 


Asp 






20 










25 










30 






Met 


Thr 


Asp 


Ser 


Asp 


Cys 


Arg 


Asp 


Asn 


Ala 


Pro 


Arg 


Thr 


He 


Phe 


He 






35 










40 










45 






He 


He 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 




50 










55 










60 








He 


Ser 


Val 


Lys 


Cys 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Cys 


Glu 


Asn 


Lys 


65 








70 










75 










80 


He 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 








85 










90 










95 




Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 
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100 105 110 

Met Gin Phe Glu Ser Ser Ser Tyr Glu Gly Tyr Phe Leu Ala Cys Glu 

115 120 125 

Lys Glu Arg Asp Leu Phe Lys Leu lie Leu Lys Lys Glu Asp Glu Leu 

130 135 140 

Gly Asp Arg Ser He Met Phe Thr Val Gin Asn Glu Asp 
145 150 155 

(8) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



15 

Glu Asp Met 
30 

Leu He He 

Thr Leu Ser 

Lys lie He 
80 

He Gin Ser 
95 

Lys Met Glu 
110 



Asn 


Phe 


Gly 


Arg 


Leu 


His 


Cys 


Thr 


Thr 


Ala 


Val 


He 


Arg 


1 








5 










10 






Asp 


Gin 


Val 


Leu 
20 


Phe 


Val 


Asp 


Lys 


Arg 
25 


Gin 


Pro 


Val 


Phe 


Thr 


Asp 


He 
35 


Asp 


Gin 


Ser 


Ala 


Ser 
40 


Glu 


Pro 


Gin 


Thr 


Arg 
45 


Tyr 


Met 


Tyr 


Lys 


Asp 


Ser 


Glu 


Val 


Arg 


Gly 


Leu 


Ala 


Val 


Val 


50 










55 








60 




Lys 


Asp 


Ser 


Lys 


Met 


Ser 


Thr 


Leu 


Ser 


Cys 


Lys 


Asn 


65 










70 










75 




Ser 


Phe 


Glu 


Glu 


Met 


Asp 


Pro 


Pro 


Glu 


Asn 


He 


Asp 


Asp 










85 










90 




Asp 


Leu 


He 


Phe 


Phe 


Gin 


Lys 


Arg 


Val 


Pro 


Gly 


His 


Asn 








100 










105 








Phe 


Glu 


Ser 


Ser 


Leu 


Tyr 


Glu 


Gly 


His 


Phe 


Leu 


Ala 


Cys 


Asp 




115 










120 










125 


Asp 


Ala 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Lys 


Asp 


Glu 


Lys 


130 










135 








140 




Ser 


Val 


Met 


Phe 


Thr 


Leu 


Thr 


Asn 


Leu 


His 


Gin 


Ser 


145 










150 










155 







(9) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 471 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



55 



(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : human 
(G) CELL TYPE: liver 

(ix) FEATURE: 

(A) NAME /KEY : mat peptide 

(B) LOCATION: 1 . . 471 

(C) IDENTIFICATION METHOD: E 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TAC TTT GGC AAG CTT GAA TCT AAA TTA TCA GTC ATA AGA AAT TTG AAT 4 8 
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15 



Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val lie Arg Asn Leu -Asn 
15 10 15 

GAC CAA GTT CTC TTC ATT GAC CAA GGA AAT CGG CCT CTA TTT GAA GAT 96 
Asp Gin Val Leu Phe lie Asp Gin Gly Asn Arg Pro Leu Phe Glu Asp 
5 20 25 30 

ATG ACT GAT TCT GAC TGT AGA GAT AAT GCA CCC CGG ACC ATA TTT ATT 14 4 

Met Thr Asp Ser Asp Cys Arg Asp Asn Ala Pro Arg Thr lie Phe lie 

35 40 45 

ATA AGT ATG TAT AAA GAT AGC CAG CCT AGA GGT ATG GCT GTA ACT ATC 192 
10 He Ser Met Tyr Lys Asp Ser Gin Pro Arg Gly Met Ala Val Thr He 

50 55 60 

TCT GTG AAG TGT GAG AAA ATT TCA ACT CTC TCC TGT GAG AAC AAA ATT 240 
Ser Val Lys Cys Glu Lys lie Ser Thr Leu Ser Cys Glu Asn Lys lie 
65 70 75 80 

ATT TCC TTT AAG GAA ATG AAT CCT CCT GAT AAC ATC AAG GAT ACA AAA 288 
lie Ser Phe Lys Glu Met Asn Pro Pro Asp Asn lie Lys Asp Thr Lys 

85 90 95 

AGT GAC ATC ATA TTC TTT CAG AGA AGT GTC CCA GGA CAT GAT AAT AAG 3 36 

Ser Asp lie lie Phe Phe Gin Arg Ser Val Pro Gly His Asp Asn Lys 

100 105 110 

ATG CAA TTT GAA TCT TCA TCA TAC GAA GGA TAC TTT CTA GCT TGT GAA 3 84 

Met Gin Phe Glu Ser Ser Ser Tyr Glu Gly Tyr Phe Leu Ala Cys Glu 

115 120 125 

AAA GAG AGA GAC CTT TTT AAA CTC ATT TTG AAA AAA GAG GAT GAA TTG 43 2 

Lys Glu Arg Asp Leu Phe Lys Leu lie Leu Lys Lys Glu Asp Glu Leu 
130 135 140 

25 GGG GAT AGA TCT ATA ATG TTC ACT GTT CAA AAC GAA GAC 471 

Gly Asp Arg Ser lie Met Phe Thr Val Gin Asn Glu Asp 
145 150 155 

(10) INFORMATION FOR SEQ ID NO: 9: 

30 { i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 

(B) TYPE : amino acid 
( D ) TO POLOG Y : linear 



20 



35 



40 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: N- terminal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 

Met Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser 
15 10 

(11) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 10 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : 1 inear 



50 



55 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: C- terminal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Ser lie Met Phe Thr Val Gin Asn Glu Asp 
15 10 
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(12) INFORMATION FOR SEQ ID NO: 11; 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 amino acids 

( B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(v) FRAGMENT TYPE: N-terminal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val lie Arq 
15 10 

(13) INFORMATION FOR SEQ ID NO: 12: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 amino acids 
20 { B ) TYPE : amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12; 

Thr lie Phe lie lie Ser Met Tyr Lys Asp Ser Gin Pro Arg 
1 5 10 

(14) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 amino acids 

( B) TYPE : amino acid 
(D) TOPOLOGY: linear 

( ii ) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal fragment 
40 ( xi ) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 

He He Ser Phe Lys Glu Met Asn Pro Pro Asp Asn He Lys Asp Thr Lys 
15 10 15 

(15) INFORMATION FOR SEQ ID NO: 14; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : mat peptide 
55 (B)LOCATION: 1..471 

(C) IDENTIFICATION METHOD: S 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TAC TTT GGC AAG CTT GAA TCT AAA TTA TCA GTC ATA AGA AAT TTG AAT 4 8 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val lie Arg Asn Leu Asn 
15 10 15 

GAC CAA GTT CTC TTC ATT GAC CAA GGA AAT CGG CCT CTA TTT GAA GAT 9 6 

Asp Gin Val Leu Phe lie Asp Gin Gly Asn Arg Pro Leu Phe Glu Asp 

20 25 30 

ATG ACT GAT TCT GAC TCT AGA GAT AAT GCA CCC CGG ACC ATA TTT ATT 144 
Met Thr Asp Ser Asp Ser Arg Asp Asn Ala Pro Arg Thr lie Phe lie 

35 40 45 

ATA AGT ATG TAT AAA GAT AGC CAG CCT AGA GGT ATG GCT GTA ACT ATC 192 
lie Ser Met Tyr Lys Asp Ser Gin Pro Arg Gly Met Ala Val Thr lie 

50 55 60 

TCT GTG AAG TCT GAG AAA ATT TCA ACT CTC TCC GCT GAG AAC AAA ATT 24 0 

Ser Val Lys Ser Glu Lys lie Ser Thr Leu Ser Ala Glu Asn Lys lie 
65 70 75 80 

ATT TCC TTT AAG GAA ATG AAT CCT CCT GAT AAC ATC AAG GAT AGA AAA 28 8 

lie Ser Phe Lys Glu Met Asn Pro Pro Asp Asn lie Lys Asp Thr Lys 

85 90 95 

AGT GAC ATC ATA TTC TTT CAG AGA AGT GTC CCA GGA CAT GAT AAT AAG 33 6 

Ser Asp lie lie Phe Phe Gin Arg Ser Val Pro Gly His Asp Asn Lys 

100 105 110 

ATG CAA TTT GAA TCT TCA TCA TAC GAA GGA TAC TTT CTA GCT TGT GAA 384 
Met Gin Phe Glu Ser Ser Ser Tyr Glu Gly Tyr Phe Leu Ala Cys Glu 
115 120 125 

25 AAA GAG AGA GAC CTT TTT AAA CTC ATT TTG AAA AAA GAG GAT GAA TTG 43 2 

Lys Glu Arg Asp Leu Phe Lys Leu lie Leu Lys Lys Glu Asp Glu Leu 

130 135 140 

GGG GAT AGA TCT ATA ATG TTC ACT GTT CAA AAC GAA GAC 471 
Gly Asp Arg Ser lie Met Phe Thr Val Gin Asn Glu Asp 
145 150 155 

30 

(15) INFORMATION FOR SEQ ID NO: 15: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B ) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: N-terminal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser 
15 10 

45 (17) INFORMATION FOR SEQ ID NO: 16: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 471 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 



40 



50 



(ii) MOLECULE TYPE: cDNA 

( ix) FEATURE : 
55 (A) NAME/KEY: mat peptide 

(B) LOCATION: 1. .471 
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(C) IDENTIFICATION METHOD: S 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 



10 



15 



20 



25 



30 



TAC 


TTT 


GGC 


AAG 


CTT 


GAA 


TCT 


AAA 


TTA 


TCA 


Tyr 
1 


Phe 


Gly 


Lys 


Leu 
5 


Glu 


Ser 


Lys 


Leu 


Ser 


GAC 


CAA 


GTT 


CTC 


TTC 


ATT 


GAC 


CAA 


GGA 


10 
AAT 


Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 








20 


- 








25 




ATG 


ACT 


GAT 


TCT 


GAC 


TCT 


AGA 


GAT 


AAT 


GGA 


Met 


Thr 


Asp 


Ser 


Asp 


Ser 


Arg 


Asp 


Asn 


Ala 






35 










40 






ATA 


AGT 


ATG 


TAT 


AAA 


GAT 


AGC 


CAG 


CCT 


AGA 


He 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 




50 










55 






TCT 


GTG 


AAG 


TCT 


GAG 


AAA 


ATT 


TCA 


ACT 


CTC 


Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


65 










70 










ATT 


TCC 


TTT 


AAG 


GAA 


ATG 


AAT 


CCT 


CCT 


GAT 


He 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 










85 










90 


AGT 


GAC 


ATC 


ATA 


TTC 


TTT 


CAG 


AGA 


AGT 


GTC 


Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 








100 








105 




ATG 


CAA 


TTT 


GAA 


TCT 


TCA 


TCA 


TAC 


GAA 


GGA 


Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 






115 










120 




AAA 


GAG 


AGA 


GAC 


CTT 


TTT 


AAA 


CTC 


ATT 


TTG 


Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 




130 










135 








GGG 


GAT 


AGA 


TCT 


ATA 


ATG 


TTC 


ACT 


GTT 


CAA 


Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


145 










150 










(18) INFORMAT ION 


FOR 


SEQ 


ID NO: 17: 





R1A AbA AAT TTG AAT 4 8 

Val lie Arg Asn Leu Asn 

15 

CGG CCT CTA TTT GAA GAT 96 
Arg Pro Leu Phe Glu Asp 

30 

CCC CGG ACC ATA TTT ATT 144 
Pro Arg Thr He Phe He 
45 

GGT ATG GCT GTA ACT ATC 192 
Gly Met Ala Val Thr He 
60 

TCC GCT GAG AAC AAA ATT 2 40 

Ser Ala Glu Asn Lys He 

AAC ATC AAG GAT ACA AAA 2 88 

Asn He Lys Asp Thr Lys 

95 

CCA GGA CAT GAT AAT AAG 336 
Pro Gly His Asp Asn Lys 

110 

TAC TTT CTA GCT TCT GAA 3 84 

Tyr Phe Leu Ala Ser Glu 
125 

AAA AAA GAG GAT GAA TTG 432 
Lys Lys Glu Asp Glu Leu 
140 



471 



155 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11464 base pairs 

(B ) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : human 

(G) CELL TYPE: placenta 

45 (ix) FEATURE: 

(A) NAME/KEY: 5' UTR 

(B) LOCATION: 1 . . 3 

<C) IDENTIFICATION METHOD: E 

(A) NAME/KEY: leader peptide 

(B) LOCATION: 4 . . 82 

50 (C) IDENTIFICATION METHOD: S 

(A) NAME/ KEY : intron 

(B) LOCATION: 83. .1453 

(C) IDENTIFICATION METHOD: E 
(A) NAME /KEY: leader peptide 

55 (B) LOCATION: 1454.. 1465 

(C) IDENTIFICATION METHOD: S 
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w 



15 



20 



(A) NAME /KEY : intron 

(B) LOCATION: 1466. .4848 

(C) IDENTIFICATION METHOD: E 

(A) NAME/KEY: leader peptide 

(B) LOCATION: 4849. .4865 

(C) IDENTIFICATION METHOD: S 

(A) NAME/KEY: mat peptide 

(B) LOCATION: 4866. .4983 

(C) IDENTIFICATION METHOD: S 

(A) NAME/KEY: intron 

(B) LOCATION: 4984. .6317 

(C) IDENTIFICATION METHOD : E 

(A) NAME/KEY: mat peptide 

(B) LOCATION: 6318. .6451 

(C) IDENTIFICATION METHOD: S 

(A) NAME/KEY: intron 

(B) LOCATION: 6452. .11224 

(C) IDENTIFICATION METHOD: E 

(A) NAME/ KEY : mat peptide 

(B) LOCATION: 11225 . . 11443 

(C) IDENTIFICATION METHOD: S 

(A) NAME/KEY: 3' UTR 

(B) LOCATION: 11444 . . 11464 

(C) IDENTIFICATION METHOD: E 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 : 

25 

AAG ATG GCT GCT GAA CCA GTA GAA GAC AAT TGC ATC AAC TTT GTG GCA 4 8 
Met Ala Ala Glu Pro Val Glu Asp Asn Cys lie Asn Phe Val Ala 
-35 -30 -25 

ATG AAA TTT ATT GAC AAT ACG CTT TAC TTT ATA G GTAAGG CTAATGCCAT 98 
Met Lys Phe lie Asp Asn Thr Leu Tyr Phe lie Ala 

30 -20 -15 -10 

AGAACAAATA CCAGGTTCAG ATAAATCTAT TCAATTAGAA AAGATGTTGT GAGGTGAACT 158 

ATTAAGTGAC TCTTTGTGTC ACCAAATTTC ACTGTAATAT TAATGGCTCT TAAAAAAATA 218 

GTGGACCTCT AGAAATTAAC CACAACATGT CCAAGGTCTC AGCACCTTGT CACACCACGT 278 

GTCCTGGCAC TTTAATCAGC AGTAGCTCAC TCTCCAGTTG GCAGTAAGTG CACATCATGA 33 8 

AAATCCCAGT TTT C ATGGG A AAATCCCAGT TTTCATTGGA TTTCCATGGG AAAAATCCCA 3 98 

35 GTACAAAACT GGGTGCATTC AGGAAATACA ATTTCCCAAA GCAAATTGGC AAATTATGTA 4 58 

AG AG ATTCT C TAAATTTAGA GTTCCGTGAA TTACACCATT TTATGTAAAT ATGTTTGACA 518 

AGTAAAAATT GATTCTTTTT TTTTTTTTCT GTTGCCCAGG CTGGAGTGCA GTGGCACAAT 57 8 

CTCTGCTCAC TGCAACCTCC ACCTCCTGGG TTCAAGCAAT TCTCCTGCCT CAGCCTTCTG 63 8 

AGTAGCTGGG ACTACAGGTG CATCCCGCCA TGCCTGGCTA ATTTTTGGGT ATTTTTACTA 6 98 

40 GAGACAGGGT TTTGGCATGT TGTCCAGGCT GGTCTTGGAC TCCTGATCTC AGATGATCCT 758 

CCTGGCTCGG GCTCCCAAAG TGCTGGGATT ACAGGCATGA ACCACCACAC ATGGCCTAAA 818 

AATTGATTCT TATGATTAAT CTCCTGTGAA CAATTTGGCT TCATTTGAAA GTTTGC CTTC 8 78 

ATTTGAAACC TTCATTTAAA AGC CTGAGC A ACAAAGTGAG ACCCCATCTC TACAAAAAAC 93 8 

TGCAAAATAT CCTGTGGACA CCTCCTACCT TCTGTGGAGG CTGAAGCAGG AGGATCACTT 998 

GAGCCTAGGA ATTTGAGCCT GCAGTGAGCT ATGATCCCAC CCCTACACTC CAGCCTGCAT 1058 

45 GACAGTAGAC CCTGACACAC ACACACAAAA AAAAACCTTC ATAAAAAATT ATTAGTTGAC 1118 

TTTTCTTAGG TGACTTTCCG TTTAAGCAAT AAATTTAAAA GTAAAATCTC TAATTTTAGA 117 8 

AAATTTATTT TTAGTTACAT ATTGAAATTT TTAAACCCTA GGTTTAAGTT TTATGTCTAA 123 8 

ATTACCTGAG AACACACTAA GTCTGATAAG CTTCATTTTA TGGGCCTTTT GGATGATTAT 12 9 8 

ATAATATTCT GATGAAAGCC AAGACAGACC CTTAAAC CAT AAAAATAGGA GTTCGAGAAA 13 58 

GAGGAGTAGC AAAAGTAAAA GCTAGAATGA GATTGAATTC TGAGTCGAAA TACAAAATTT 1418 

50 TACATATTCT GTTTCTCTCT TTTTCCCCCT CTTAG CT GAA GAT GAT G GTAAA 147 0 

Ala Glu Asp Asp Glu 
-10 

GTAGAAATGA ATTTATTTTT CTTTGCAAAC TAAGTATCTG CTTGAGACAC ATCTATCTCA 153 0 

CCATTGTCAG CTGAGGAAAA AAAAAAATGG TTCTCATGCT ACCAATCTGC CTTCAAAGAA 15 90 

55 ATGTGGACTC AGTAGCACAG CTTTGGAATG AAG ATG AT C A TAAGAGATAC AAAGAAGAAC 16 50 

CTCT AG C AAA AG ATG CTTCT CTATGCCTTA AAAAATTCTC CAGCTCTTAG AATCTACAAA 1710 
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ATAGACTTTG C CTGTTT CAT TGGTCCTAAG ATT AG CATGA AGCCATGGAT TCTGTTGTAG 1770 

GGGGAGCGTT GCATAGGAAA AAGGGATTGA AGCATTAGAA TTGTCCAAAA TCAGTAACAC 183 0 

CTCCTCTCAG AAATGCTTTG GGAAGAAGCC TGGAAGGTTC CGGGTTGGTG GTGGGGTGGG 18 90 

GCAGAAAATT CTGGAAGTAG AGGAGATAGG AATGGGTGGG GCAAGAAGAC CACATTCAGA 19 SO 

5 GGCCAAAAGC TGAAAGAAAC CATGGCATTT ATGATGAATT CAGGGTAATT CAGAATGGAA 2010 

GTAGAGTAGG AGTAGGAGAC TGGTGAGAGG AGCTAGAGTG ATAAACAGGG TGTAGAGCAA 2070 

GACGTTCTCT CACCCCAAGA TGTGAAATTT GGACTTTATC TTGGAGATAA TAGGGTTAAT 213 0 

TAAGCACAAT ATGTATTAGC TAGGGTAAAG ATTAGTTTGT TGTAACAAAG ACATCCAAAG 2190 

ATACAGTAGC TGAATAAGAT AGAGAATTTT TCTCTCAAAG AAAGTCTAAG TAGGCAGCTC 2250 

w AGAAGTAGTA TGGCTGGAAG CAACCTGATG ATATTGGGAC CCCCAACCTT CTTCAGTCTT 2310 

GTACCCATCA TCCCCTAGTT GTTGATCTCA CTCACATAGT TGAAAATCAT CATACTTCCT 2370 

GGGTT CAT AT CCCAGTTATC AAGAAAGGGT CAAGAGAAGT CAGGCTCATT CCTTTCAAAG 243 0 

ACTCTAATTG GAAGTTAAAC ACATCAATCC CCCTCATATT CCATTGACTA GAATTTAATC 2490 

ACATGGCCAC ACCAAGTGCA AGGAAATCTG GAAAATATAA TCTTTATTCC AGGTAGCCAT 2550 

ATGACTCTTT AAAATTCAGA AATAATATAT TTTTAAAATA TCATTCTGGC TTTGGTATAA 2610 

15 AGAATTGATG GTGTGGGGTG AGGAGGCCAA AATTAAGGGT TGAGAGCCTA TTATTTTAGT 2670 

TATTACAAGA A ATGA TGGTG TCATGAATTA AGGTAGACAT AGGGGAGTGC TGATGAGGAG 273 0 

CTGTGAATGG ATTTTAGAAA CACTTGAGAG AATCAATAGG ACATGATTTA GGGTTGGATT 2790 

TGGAAAGGAG AAGAAAGTAG AAAAGATGAT GCCTACATTT TTCACTTAGG CAATTTGTAC 2850 

CATTCAGTGA AATAGGGAAC ACAGGAGGAA GAGCAGGTTT TGGTGTATAC AAAGAGGAGG 2910 

ATGGATGACG CATTTCGTTT TGGATCTGAG ATGTCTGTGG AACGTCCTAG TGGAGATGTC 2970 

20 CACAAACTCT TCTACATGTG GTTCTGAGTT CAGGACACAG ATTTGGGCTG GAGATAGAGA 3 03 0 

TATTGTAGGC TTATACATAG AAATGGCATT TGAATCTATA GAGATAAAAA GACACATCAG 3 090 

AGGAAATGTG TAAAGTGAGA GAGGAAAAGC CAAGTACTGT GCTGGGGGGA ATACCTACAT 3150 

TTAAAGGATG CAGTAGAAAG AAGCTAATAA ACAACAGAGA GCAGACTAAC CAAAAGGGGA 3210 

GAAGAAAAAC CAAGAGAATT CCACCGACTC CCAGGAGAGC ATTTCAAGAT TGAGGGGATA 3 270 

2$ GGTGTTGTGT TGAATTTTGC AGCCTTGAGA ATCAAGGGCC AGAACACAGC TTTTAGATTT 3330 

AGCAACAAGG AGTTTGGTGA TCTCAGTGAA AGCAGCTTGA TGGTGAAATG GAGGCAGAGG 3 3 90 

CAGATTGCAA TGAGTGAAAC AGTGAATGGG AAGTGAAGAA ATGATACAGA TAATTCTTGC 34 50 

TAAAAGCTTG GCTGTTAAAA GGAGGAGAGA AACAAGACTA GCTGCAAAGT GAGATTGGGT 3510 

TGATGGAGCA GTTTTAAATC TCAAAATAAA GAGCTTTGTG CTTTTTTGAT TATGAAAATA 3 57 0 

ATGTGTTAAT TGTAACTAAT TGAGGCAATG AAAAAAGATA ATAATATGAA AGATAAAAAT 363 0 

30 ATAAAAACCA CCCAGAAATA ATGATAGCTA CCATTTTGAT ACAATATTTC TACACTCCTT 3 6 90 

TCTATGTATA TATACAGACA CAGAAATGCT TATATTTTTA TTAAAAGGGA TTGTACTATA 37 50 

CCTAAGCTGC TTTTTCTAGT TAGTGATATA TATGGACATC TCTCCATGGC AACGAGTAAT 3 810 

TGCAGTTATA TT AAGTT CAT GATATTTCAC AATAAGGGCA TATCTTTGCC CTTTTTATTT 3870 

AATCAATTCT TAATTGGTGA ATGTTTGTTT CCAGTTTGTT GTTGTTATTA ACAATGTTCC 3 93 0 

CATAAGCATT CCTGTACACC AATGTTCACA CATTTGTCTG ATTTTTTCTT CAGGATAAAA 3 9 90 

35 CCCAGGAGGx AbAAilbLiG ^IIGATAGA AGAGAAAGGA TGATTGCCAA ATTAAAGCTT 40 50 

CAGTAGAGGG TACATGCCGA GCACAAATGG GATCAGCCCT AGATACCAGA AATGGCACTT 4110 

TCTCATTTCC CCTTGGGACA AAAGGGAGAG AGGCAATAAC TGTGCTGCCA GAGTTAAATT 417 0 

TGTACGTGGA GTAGCAGGAA AT CATTTGCT GAAAATGAAA ACAGAGATGA TGTTGTAGAG 423 0 

GTCCTGAAGA GAGCAAAGAA AATTTGAAAT TGCGGCTATC AGCTATGGAA GAGAGTGCTG 4 2 90 

4Q AACTGGAAAA CAAAAGAAGT ATTGACAATT GGTATGCTTG TAATGGCACC GATTTGAACG 4 3 50 

CTTGTGCCAT TGTTCACCAG CAGCACTCAG CAGCCAAGTT TGGAGTTTTG TAGCAGAAAG 4 410 

ACAAATAAGT TAGGGATTTA ATATCCTGGC CAAATGGTAG ACAAAATGAA CTCTGAGATC 44 70 

CAGCTGCACA GGGAAGGAAG GGAAGACGGG AAGAGGTTAG ATAGGAAATA CAAGAGTGAG 45 3 0 

GAGACTGGAA GATGTTGTGA TATTTAAGAA CACATAGAGT TGGAGTAAAA GTGTAAGAAA 45 90 

ACTAGAAGGG TAAGAGACCG GTCAGAAAGT AGGCTATTTG AAGTT AACAC TTCAGAGGCA 4650 

45 GAGTAGTTCT GAATGGTAAC AAGAAATTGA GTGTGCCTTT GAGAGTAGGT TAAAAAACAA 4 710 

TAGGCAACTT TATTGTAGCT ACTTCTGGAA CAGAAGATTG TCATTAATAG TTTTAGAAAA 4770 

CTAAAATATA TAGCATACTT ATTTGTCAAT TAACAAAGAA ACTATGTATT TTTAAATGAG 48 30 

ATTTAATGTT TATTGTAG AA AAC CTG GAA TCA GAT TAC TTT GGC AAG CTT 4880 

Glu Asn Leu Glu Ser Asp Tyr Phe Gly Lys Leu 
-5 1 5 

50 GAA TCT AAA TTA TCA GTC ATA AGA AAT TTG AAT GAC CAA GTT CTC TTC 4 92 8 
Glu Ser Lys Leu Ser Val He Arg Asn Leu Asn Asp Gin Val Leu Phe 

10 15 20 

ATT GAC CAA GGA AAT CGG CCT CTA TTT GAA GAT ATG ACT GAT TCT GAC 4 976 
He Asp Gin Gly Asn Arg Pro Leu Phe Glu Asp Met Thr Asp Ser Asp 

55 25 30 35 

TGT AGA G GTATTTTTT TTAATTCGCA AACATAGAAA TG ACT AG CTA CTTCTTCCCA 503 2 
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20 



Cys Arg Asp 
40 

TTCTGTTTTA CTG CTT AC AT TGTTCCGTGC TAGTCCCAAT CCTCAGATGA AAAGTCACAG 5 092 

GAGTGACAAT AATTT CACTT ACAGGAAACT TTATAAGGCA TCCACGTTTT TTAGTTGGGG qiqo 

5 TAAAAAATTG GATACAATAA GACATTGCTA GGGGTCATGC CTCTCTGAGC CTGCCTTTGA s?l? 

ATCACCAATC CCTTTATTGT GATTGCATTA ACTGTTTAAA ACCTCTATAG TTGGATGCTT 52 7 2 

AATCCCTGCT TGTTACAGCT GAAAATGCTG ATAGTTTACC AGGTGTGGTG GCATCTATCT 53 3 2 

GTAATCCTAG CTACTTGGGA GGCTCAAGCA GGAGGATTGC TTGAGGCCAG GACTTTGAGG 5 3 92 

CTGTAGTACA CTGTGATCGT ACCTGTGAAT AGCCACTGCA CTCCAGCCTG GGTGATATAC 5 4 52 

w AGACCTTGTC TCTAAAATTA AAAAAAAAAA AAAAAAAAAC CTTAGGAAAG GAAATTGATC 5 512 

AAGTCTACTG TGCCTTCCAA AACATGAATT CCAAATATCA AAGTTAGGCT GAGTTGAAGC 557 2 

AGTGAATGTG CATTCTTTAA AAATACTGAA TACTTACCTT AACATATATT TTAAATATTT 5 63 2 

TATTTAGCAT TTAAAAGTTA AAAACAATCT TTTAGAATTC AT AT CTTTAA AATACTCAAA Sfi^ 

AAAGTTGCAG CGTGTGTGTT GTAATACACA TTAAACTGTG GGGTTGTTTG TTTGTTTGAG 5 752 

ATGCAGTTTC ACTCTGTCAC CCAGGCTGAA GTGCAGTGCA GTGCAGTGGT GTGATCTCGG 5ft n 

75 CTCACTACAA CCTCCACCTC CCACGTTCAA GCGATTCTCA TGCCTCAGTC TCCCGAGTAG 5fl7? 

GTGGGATTAC AGGCATGCAC CACTTACACC CGGCTAATTT TTGTATTTTT AGTAGAGCTG 5 93 9 

GGGTTTCACC ATGTTGGCCA GGCTGGTCTC AAACCCCTAA CCTCAAGTGA TCTGCCTGCC 5 992 

TCAGCCTCCC AAACAAACAA ACAACCCCAC AGTTTAATAT GTGTTACAAC ACACATGCTG 6 05 2 

CAACTTTTAT GAGTATTTTA ATGATATAGA TTATAAAAGG TTGTTTTTAA CTTTTAAATG 6112 

CTGGGATTAC AGGCATGAGC CACTGTGCCA GGCCTGAACT GTGTTTTTAA AAATGTCTGA 617 2 

CCAGCTGTAC AT AGTCTC CT GCAGACTGGC CAAGTCTCAA AGTGGGAACA GGTGTATTAA 623 2 

GGACTATCCT TTGGTTAAAT TTCCGCAAAT GTTCCTGTGC AAGAATTCTT CTAACTAGAG 6 9 9? 

TTCTCATTTA TTATATTTAT TTCAG AT AAT GCA CCC CGG ACC ATA TTT ATT 6 34 3 

Asp Asn Ala Pro Arg Thr He Phe He 
40 45 

2S ATA AGT ATG TAT AAA GAT AGC CAG CCT AGA GGT ATG GCT GTA ACT ATC 63 91 
He Ser Met Tyr Lys Asp Ser Gin Pro Arg Gly Met Ala Val Thr lie 

50 55 60 

TCT GTG AAG TGT GAG AAA ATT TCA ACT CTC TCC TGT GAG AAC AAA ATT 643 9 
Ser Val Lys Cys Glu Lys He Ser Thr Leu Ser Cys Glu Asn Lys He 
f 5 70 75 80 

30 ATT TCC TTT AAG GTAAG ACTGAGCCTT ACTTTGTTTT CAATCATGTT AATATAATCA 64 96 
He Ser Phe Lys 

ATATAATTAG AAATATAACA TTATTTCTAA TGTTAATATA AGTAATGTAA TTAGAAAACT 6 55 6 

CAAATATCCT CAGACCAACC TTTTGTCTAG AACAGAAATA ACAAGAAGCA GAGAACCATT 6616 

AAAGTGAATA CTTACTAAAA ATTATCAAAC TCTTTAC CTA TTGTGATAAT GATGGTTTTT 667 6 

CTGAGCCTGT CACAGGGGAA GAGGAGATAC AACACTTGTT TTATGACCTG CATCTCCTGA 673 6 

ACAATCAGTC TTTATACAAA TAATAATGTA GAATACATAT GTGAGTTATA CATTTAAGAA 6 796 

TAACATGTGA CTTTCCAGAA TGAGTTCTGC TATGAAGAAT GAAGCTAATT ATCCTTCTAT 6856 

ATTTCTACAC CTTTGTAAAT TATGATAATA TTTTAATCCC TAGTTGTTTT GTTGCTGATC 6 916 

CTTAGCCTAA GTCTTAGACA CAAGCTTCAG CTT C C AGTTG ATGTATGTTA TTTTTAATGT 6 97 6 

TAATCTAATT GAATAAAAGT TATGAGATCA GCTGTAAAAG TAATGCTATA ATT AT CTT C A 703 6 

4Q AGCCAGGTAT AAAGTATTTC TGGCCTCTAC TTTTTCTCTA TTATTCTCCA TTATTATTCT 70 96 

CTATTATTTT TCTCTATTTC CTCCATTATT GTTAGATAAA CCACAATTAA CTATAGCTAC 7156 

AGACTGAGCC AGTAAGAGTA GCCAGGGATG CTTACAAATT GGCAATGCTT CAGAGGAGAA 7216 

TTCCATGTCA TGAAGACTCT TTTTGAGTGG AGATTTGCCA ATAAATATCC GCTTTCATGC 7276 

CCACCCAGTC CCCACTGAAA GACAGTTAGG ATATGACCTT AGTGAAGGTA CCAAGGGGCA 73 3 6 

ACTTGGTAGG GAGAAAAAAG CCACTCTAAA ATATAATCCA AGTAAGAACA GTGCATATGC 73 96 

45 AACAGATACA GCCCCCAGAC AAATCCCTCA GCTATCTCCC TCCAACCAGA GTGCCACCCC 74 56 

TTCAGGTGAC AATTTGGAGT CCCCATTCTA GAC CTGACAG GCAGCTTAGT TATCAAAATA 7 516 

G CAT AAG AGG CCTGGGATGG AAGGGTAGGG TGGAAAGGGT TAAGCATGCT GTTACTGAAC 75 76 

AACATAATTA GAAGGGAAGG AGATGGCCAA GCTCAAGCTA TGTGGGATAG AGGAAAACTC 76 3 6 

AGCTGCAGAG G CAG ATT CAG AAACTGGGAT AAGTCCGAAC CTACAGGTGG ATTCTTGTTG 7696 

AGGGAGACTG GTGAAAATGT TAAGAAGATG GAAATAATGC TTGGCACTTA GTAGGAACTG 77 56 

GGCAAATCCA TATTTGGGGG AGCCTGAAGT TTATTCAATT TTGATGGCCC TTTTAAATAA 7816 

AAAGAATGTG GCTGGGCGTG GTGGCTCACA CCTGTAATCC CAGCACTTTG GGAGGCCGAG 78 7 6 

GGGGGCGGAT CACCTGAAGT CAGGAGTTCA AG AC C AGC CT GACCAACATG GAGAAACCCC 7936 

ATCTCTACTA AAAATACAAA ATTAGCTGGG CGTGGTGG C A TATGCCTGTA ATCCCAGCTA 7 996 

CTCGGGAGGC TGAGGCAGGA GAATCTTTTG AACCCGGGAG GCAGAGGTTG CGATGAGCCT 8056 

55 AG ATCGTG C C ATTGCACTCC AGCCTGGGCA ACAAGAGCAA AACTCGGTCT CAAAAAAAAA 8116 

AAAAAAAAAG TGAAATTAAC CAAAGGCATT AGCTTAATAA TTTAATACTG TTTTTAAGTA 8176 



50 



45 



EP 0 861 663 A2 



GGGCGGGGGG TGGCTGGAAG AGATCTGTGT AAATG AGGG A 
CAGCATCATA GCAAATCTGC TTCTGGAAGG AACTCAATAA 
GAGAGTGAGG GGTGGACTAG GACCAGTTTT AGCCCTTGTC 
CTAATAAGGA TCTTAGCAGT GGTTATAAAA GTGGCCTAGG 
CAGGCCAGGC ACAGTGGCTC ATGCCTATAA TCCCAGCACT 
TCTCACTTGA GATCAGGAGT TCAAGACCAG CCTGGCCAGC 
CTAAAAAAAA TACAAAAATT AG C C AGG CAT GGTGGCATGC 
GTGAGCCTGA GGCAGAAGAA TCGCTTGAAA CCAGGAGGTG 
TCGCACCACT GCACTCCAGC CTGGGCGACA GAATGAGACT 
GATACAACAG GCTACCCTTA TGTGCTCACC TTTCACTGTT 
TATAAAGTTC TTTGGTCAAG AACCTTGACA ACACTAAGAG 
CTGTCAGAGT CTGTTTCATA TATATACATA TACATGTATA 
TGGCCAGGGT TCCCTCAGAC TTTCCAGTGC ACTTGGGAGA 
TCCCTGGATT CAGATTCAAC CCCTTCTGAT GTAAAAAAAA 
CCTTTCCCCT TGGAGCACTC AAGTTTCACC AGGTGGGGCT 
AAGGTCATTG GGATTGCTTT CACATCCATT TGCTATGTAC 
GTGGTCAACA TCAAAACTAG GAAAGCTACT GCCCAAGGAT 
ATGTGCAATA AGTGTGATTA AAGAGATTGC CTGTTCTACC 
ACTGTAACTT TCTTTTTTTC TTTTTTTCTT TTTTTCTTTT 
TCTGTCGCCC AGGCTAGAGT G CAGTGGCAC GATCTCAGCT 
GGGTTCACGC CATTCTCCTG CCTCACCCTC CCAAGCAGCT 
CCATGCCCAG CTAATTTTTT GTATTTTTAG TAGAGACGGG 
ATGGTCTCGA TCTCCTGAAC TTGTGATCCG CCCGCCTCAG 
ACAGGCGTGA GCCATCGCAC CCGGCTCAAC TGTAACTTTC 
TGTAATGTTA CTAGAGCTTT TGAAGTTTTG GCTATGGATT 
ATTTCAGATT AGTTCCAAAT TGATGCCCAC AGCTTAGGGT 
GTAGACAGCT GCAGAAGTGG GTGCCAATAG GGGAACTAGT 
GACCCACACT TGTTGATAAA GAACAAAGGT CAAGAGTTAT 
ATTGAGAAGT TGGAGATAAC CCCGTGACCT CTGCCATCCA 
AAGGATGAAG AAATGCTATT TTAATTTTGG AGGTTTCT CT 
GAATCTGTGC TGCCATGAGG CCAAAATTAA GTCCAAAACA 
CATGGAAGAA CCTTAGGTGG TGCCCACATG TTCTGATCCA 
G C ACT AACAG GAAAAGTGCA GG C AG C AC T A CCAGTTGGAT 
CAAGTAATCT AACCATTTCT CACAAGGCCC TATTCTGTGA 
CATTTGGCCT TCTAAGGCAG GGCCCAGCCA AGGAGACCAT 
ACTACTATGG AACTGGAGTG CTTGGCAGGG AAGACAGAGT 
AATACAGCAG GCTTACACAG GAACCCAGGG CCTAGCCCTA 
CACTGTAAGT TTTAATTTCA GGCTCCACTG AAAGAGTAAG 
TGTCTCTCTC ACAGTTGGCT CAGAAATGAG AACTGGT CAG 
CCTGGAATCC CAGCACTTTG GGAGGCCGAA GTGGGAGGGT 
GGACCAGCTT AGGCAACAAA GTGAGATACC CCCTGACCCC 
TAAAAATTAG CCAAATGTGG TGGTGTATAC TTACAGTCCC 
CAGGGGGATT GCTTGAGCCC AGGAATTCAA GGCTGCAGTG 
ACTTCTGGCT GGGCAACAGA GCGAGACCCT GTCTCAAAGC 
ACTAGCCTAA GTTTGTGGGA GGAGGTCATC ATCGTCTTTA 
AGGACAGAAA TTGACATTAG CCCAAAAAGC TTGTGGTCTT 
TTGAGCAAAT GTGGACACCA CT CAATGGG A GAGGAGAGAA 
GGGAAAACTA GAGGCCTGGA ACTGAATATG CATCCCATGA 
GAGTTAAGAA GGAGAGGAGG TCAGTACTGC TGTTCAGAGA 
AGAAGCAAAA CTACTTTTGT TCTGTTTGGT AATATACTTC 
AAATTGTTCA TGTC CTG AAA TAATTAGGTA ATGTTTTTTT 



CCT 


CCT 


GAT 


AAC 


ATC 


AAG 


GAT 


ACA 


AAA 


AGT 


GAC 


Pro 


Pro 


Asp 


Asn 


lie 


Lys 


Asp 


Thr 


Lys 


Ser 


Asp 






90 










95 






AGA 


AGT 


GTC 


CCA 


GGA 


CAT 


GAT 


AAT 


AAG 


ATG 


CAA 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 


Met 


Gin 




105 










110 










TAC 


GAA 


GGA 


TAC 


TTT 


CTA 


GCT 


TGT 


GAA 


AAA 


GAG 


Tyr 


Glu 


Gly 




Phe 


Leu 


Ala 


Cys 


Glu 


Lys 


Glu 


120 










125 








130 



ATCTGACATT T AAGC-TT CAT 82 3 6 
ATATTAGTTG GAGGGGGGGA 82 96 
TTTAATCCCT TTTCCTGCCA 83 56 
TTCTAGATAA TAAGATACAA 8416 
TTGGGAGGGC AAGGCGAGTG 84 76 
ATGGCGATAC TCTGTCT CTA 85 3 6 
ACCTGTAATC CCAGCTACTC 8 5 96 
TAGGCTGCAG TGAGCTGAGA 8656 
TTGTCTCAAA AAAAGAAAAA 8716 
GATTACTAGC T AT AAAGTC C 8776 
GGATTTGCTT TGAGAGGTTA 883 6 
TATGTATCTA TATCCAGGCT 88 96 
TGTTAGGTCA ATATCAACTT 8956 
AAAAAAAAAA GAAAGAAATC 9016 
TTCCAAGTTG GGGGTTCTCC 9076 
CTTCCCTATG ATGGCTGGGA 9136 
GTCCTTACCT CTATTCTGAA 9196 
TATCCACACT CTCGCTTTCA 9256 
TTTTTGAAAC GGAGTCTCGC 9316 
CACTGCAAGC TCTGCCTCCC 9376 
G GGA CTACAG GCGCCTGCCA 9436 
GTTTCACCGT GTTAGCCAGG 94 96 
CCTCCCAAAG TGCTGGGATT 9556 
TATACTGGTT CATCTTCCCC 9616 
ATTTCTCATT TATACATTAG 9676 
CTCTTCCTAA ATTGTATATT 97 36 
TTATACTTTC ATCAACTTAG 9796 
GACTACTGAT TCCACAACTG 98 56 
GAGTCTTTCA GGCATCTTTG 9916 
ATCAGTGCTT AGGATCATGG 9 976 
TCTACTGGTT CCAGGATTAA 10036 
TCCTGCAAAA TAGACATGCT 100 96 
AACCTGCAAG ATTATAGTTT 10156 
CTGAAACATA C AAGAAT CTG 10216 
ATTCAGGACA GAAATTCAAG 102 76 
CAAGGACTGC CAACTGAGCC 103 3 6 
CAACAATTAT TGGGTCTATT 10 3 96 
CTAAGATTCC TGGCACTTTC 104 56 
GCCAGGCATG GTGGCTTACA 10516 
CACTTGAGGC CAGGAGTTCA 10 5 76 
TTCTCTACAA AAATAAATTT 106 36 
AGCTACTCAG GAGGCTGAGG 10 6 96 
AGCTATGATT TCACCACTGC 10 7 56 
AAAAAGAAAA AGAAACTAGA 10816 
GCCGTGAATG GTTATTATAG 108 76 
TGCTGGAACT CTACTTAATC 10 93 6 
GTAAGCTGTT TGATGTATAG 10 9 96 
CAGGGAGAAT AGGAGATTCG 11056 
TTTTTTTTAT GTAACTCTTG 11116 
AAAACAAACT TCATATATTC 11176 
CTCTATAG GAA ATG AAT 1123 3 

Glu Met Asn 

85 

ATC ATA TTC TTT CAG 11281 
lie lie Phe Phe Glu 
100 

TTT GAA TCT TCA TCA 11329 

Phe Glu Ser Ser Ser 

115 

AGA GAC CTT TTT AAA 11377 
Arg Asp Leu Phe Lys 

135 
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CTC ATT TTG AAA AAA GAG 
Leu lie Leu Lys Lys Glu 

140 

ACT GTT CAA AAC GAA GAC 
Thr Val Gin Asn Glu Asp 

155 

(19) INFORMATION FOR SEQ ID NO : 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA Co mRNA 

(vi) ORIGINAL SOURCE: 

'(A) ORGANISM: mouse 
(G) CELL TYPE: liver 

(ix) FEATURE: 

(A) NAME /KEY : mat peptide 

(B) LOCATION: 1. .471 

(C) IDENTIFICATION METHOD: S 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



AAC 


TTT 


GGC 


CGA 


CTT 


CAC 


TGT 


ACA 


ACC 


GCA 


GTA 


ATA 


CGG 


AAT 


ATA 


AAT 


Asn 


Phe 


Gly 


Arg 


Leu 


His 


Cys 


Thr 


Thr 


Ala 


Val 


He 


Arg 


Asn 


He 


Asn 


1 








5 










10 








15 




GAC 


CAA 


GTT 


CTC 


TTC 


GTT 


GAC 


AAA 


AGA 


CAG 


CCT 


GTG 


TTC 


GAG 


GAT 


ATG 


Asp 


Gin 


Val 


Leu 


Phe 


Val 


Asp 


Lys 


Arg 


Gin 


Pro 


Val 


Phe 


Glu 


Asp 


Met 








20 










25 










30 




ACT 


GAT 


ATT 


GAT 


CAA 


AGT 


GCC 


AGT 


GAA 


CCC 


CAG 


ACC 


AGA 


CTG 


ATA 


ATA 


Thr 


Asp 


lie 


Asp 


Gin 


Ser 


Ala 


Ser 


Glu 


Pro 


Gin 


Thr 


Arg 


Leu 


He 


He 






35 










40 










45 








TAC 


ATG 


TAC 


AAA 


GAC 


AGT 


GAA 


GTA 


AGA 


GGA 


CTG 


GCT 


GTG 


ACC 


CTC 


TCT 


Tyr 


Met 


Tyr 


Lys 


Asp 


Ser 


Glu 


Val 


Arg 


Gly 


Leu 


Ala 


Val 


Thr 


Leu 


Ser 




50 










55 










60 










GTG 


AAG 


GAT 


AGT 


AAA 


ATG 


TCT 


ACC 


CTC 


TCC 


TGT 


AAG 


AAC 


AAG 


ATC 


ATT 


Val 


Lys 


Asp 


Ser 


Lys 


Met 


Ser 


Thr 


Leu 


Ser 


Cys 


Lys 


Asn 


Lys 


lie 


He 


65 










70 










75 






80 


TCC 


TTT 


GAG 


GAA 


ATG 


GAT 


CCA 


CCT 


GAA 


AAT 


ATT 


GAT 


GAT 


ATA 


CAA 


AGT 


Ser 


Phe 


Glu 


Glu 


Met 


Asp 


Pro 


Pro 


Glu 


Asn 


He 


Asp 


Asp 


He 


Gin 


Ser 










85 










90 








95 




GAT 


CTC 


ATA 


TTC 


TTT 


CAG 


AAA 


CGT 


GTT 


CCA 


GGA 


CAC 


AAC 


AAG 


ATG 


GAG 


Asp 


Leu 


lie 


Phe 


Phe 


Gin 


Lys 


Arg 


Val 


Pro 


Gly 


His 


Asn 


Lys 


Met 


Glu 








100 










105 










110 






TTT 


GAA 


TCT 


TCA 


CTG 


TAT 


GAA 


GGA 


CAC 


TTT 


CTT 


GCT 


TGC 


CAA 


AAG 


GAA 


Phe 


Glu 


Ser 


Ser 


Leu 


Tyr 


Glu 


Gly 


His 


Phe 


Leu 


Ala 


Cys 


Gin 


Lys 


Glu 






115 










120 










125 






GAT 


GAT 


GCT 


TTC 


AAA 


CTC 


ATT 


CTG 


AAA 


AAA 


AAG 


GAT 


GAA 


AAT 


GGG 


GAT 


Asp 


Asp 


Ala 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Lys 


Asp 


Glu 


Asn 


Gly 


Asp 




130 










135 










140 






AAA 


TCT 


GTA 


ATG 


TTC 


ACT 


CTC 


ACT 


AAC 


TTA 


CAT 


CAA 


AGT 








Lys 


Ser 


Val 


Met 


Phe 


Thr 


Leu 


Thr 


Asn 


Leu 


His 


Gin 


Ser 








145 










150 










155 













(20) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 



GAT GAA TTG GGG GAT AGA TCT ATA ATG 'TTC 
Asp Glu Leu Gly Asp Arg Ser He Met Phe 

145 iso 
TAGCTATTAA AATTTCATGC C 
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(A) LENGTH: 9 amino acids 

(B ) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(v) FRAGMENT TYPE: N-terminal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

Asn Phe Gly Arg Leu His Cys Thr Thr 
1 5 

(21) INFORMATION FOR SEQ ID NO: 20: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



20 



w 



25 



30 



50 





(ii) MOLECULE 


TYPE: peptide 


















(xi) SEQUENCE 


DESCRIPTION: 


SEQ ID 


NO: 


20 : 










Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


Arg 


Asn 


Leu 


Asn 


1 








5 










10 










15 




Asp 


Gin 


Val 


Leu 


Phe 


lie 


Asp 


Gin 


Gly 


Asn 


Arg 


Pro 


Leu 


Phe 


Glu 


Asp 








20 










25 










30 




Met 


Thr 


Asp 


Ser 


Asp 


Cys 


Arg 


Asp 


Asn 


Ala 


Pro 


Arg 


Thr 


He 


Phe 


He 






35 










40 










45 








lie 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 




50 










55 










60 










Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Cys 


Glu 


Asn 


Lys 


He 


65 










70 










75 








80 


He 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 










85 










90 










95 


Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 








100 










105 










110 




Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Cys 


Glu 






115 










120 










125 






Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 




130 










135 










140 










Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 








145 










150 










155 











40 



(22) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE 


DESCRIPTION: 


SEQ ID 


NO: 


21 : 










Tyr 


Phe Gly Lys Leu 


Glu Ser Lys 


Leu Ser 


val 


He 


Arg 


Asn 


Leu 


Asn 


1 


5 




10 










15 




Asp 


Gin Val Leu Phe 


He Asp Gin 


Gly Asn 


Arg 


Pro 


Leu 


Phe 


Glu 


Asp 




20 




25 








30 




Met 


Thr Asp Ser Asp 


Ser Arg Asp 


Asn Ala 


Pro 


Arg 


Thr 


He 


Phe 


He 
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3 5 










4 O 










45 








Ti- 
ne 


ser 


kit a *• 

Met 


Tyr 


Lys 


ASp 


O *r\ v- 

oe r 


bin 


pro 


Arg 


Gly 


ft X — t_ 

Met 


Ala 


Val 


Thr 


He 




50 










55 










60 










ser 


vai 


Lys 


ber 


bill 


jjys 


i ± e 




l nr 


Leu 


Ser 


Cys 


Glu 


Asn 


Lys 


He 


65 










70 










75 








80 


He 


Ser 


Pne 


Lys 


CjIU 




as n. 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


it — 

Asp 


Thr 


Lys 










85 










90 










95 




Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 








100 










105 










110 






Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Cys 


Glu 






115 










120 










125 






Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 




130 










135 










140 










Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 








145 










150 










155 













(23) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : peptide 





(xi) SEQUENCE 


DESCRIPTION: 


SEQ ID 


NO: 


22 : 










Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


Arg 


Asn 


Leu 


Asn 


1 








5 










10 










15 




Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Arg 


Pro 


Leu 


Phe 


Glu 


Asp 








20 










25 










30 




Met 


Thr 


Asp 


Ser 


Asp 


Cys 


Arg 


Asp 


Asn 


Ala 


Pro 


Arg 


Thr 


He 


Phe 


He 






35 










40 










45 








He 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 




50 










55 










60 










Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Cys 


Glu 


Asn 


Lys 


He 


65 










70 










75 








80 


He 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 










85 










90 










95 


Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 








100 










105 










110 




Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Ser 


Glu 






115 










120 










125 








Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 




130 










135 










140 










Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 








145 










150 










155 











(24) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val He Arg Asn Leu Asn 

15 10 15 

Asp Gin Val Leu Phe He Asp Gin Gly Asn Arg Pro Leu Phe Glu Asp 
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w 



30 









20 










25 










30 




Met 


Thr 


Asp 


Ser 


Asp 


Ser 


Arq 


Asp 


Asn 


Ala 


Pro 


Aro 


Thr* 




tr lie 


lie 




35 










40 








45 






Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arq 


Gly 


Met 


Ala 


Val 


i iti. 




50 










55 








60 








Ser 


Val 


Lys 


Ser 


Glu 


Lys 


lie 


Ser 


Thr 


. Leu 


Ser 


Cvs 




7V T-t 


Lys 


65 










70 










75 






lie 


Ser 


Phe 


Lvs 


Glu 


Met 


Asn 






nop 


7V c~ « 




Lys 


Asp 


Thr 










85 










90 






95 


Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 








100 










105 








110 




Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Ser 






115 










120 








125 






Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


lie 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 




130 










135 










140 






Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 






145 










150 










155 









80 



15 



(25) INFORMATION FOR SEQ ID NO : 24: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Ser Val He Arg Asn Leu As 
10 15 
Asn Arg Pro Leu Phe Glu As 

30 

Ala Pro Arg Thr He Phe II 

45 

Arg Gly Met Ala Val Thr II 
60 

Leu Ser Ser Glu Asn Lys II 

75 on 

Asp Asn lie Lys Asp Thr Ly 
90 95 

Val Pro Gly His Asp Asn Ly 

110 

Gly Tyr Phe Leu Ala Ser Gl 
40 Hb 120 125 

..eu Lys Lys Glu Asp Glu Le 
140 

Jin Asn Glu Asp 
155 



Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


1 








5 










Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 








20 










25 


Met 


Thr 


Asp 


Ser 


Asp 


Ser 


Arg 


Asp 


Asn 






35 










40 




He 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 




50 










55 






Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


65 










70 








He 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 










85 










Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 








100 










105 


Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 






115 










120 




Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 




130 










135 






Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


145 










150 








(26) 


INFORMATION FOR SEQ ID 


NO: 


25 : 



45 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 7 amino acids 

(B) TYPE: amino acid 
5Q <D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

55 Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val He Arg Asn Leu Asn 

15 10 15 
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Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Arg 


Pro 


Leu 


Phe 


Glu- 


-Asp 






20 










25 










30 






Met 


Thr 


Asp 


Ser 


Asp 


Ser 


Arg 


Asp 


Asn 


Ala 


Pro 


Arg 


Thr 


He 


Phe 


He 






35 










40 










45 








He 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 




50 










55 










60 










Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Ala 


Glu 


Asn 


Lys 


He 


65 










70 










75 










80 


lie 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 










85 










90 










95 




Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 








100 










105 










110 






Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Cys 


Glu 






115 










120 










125 








Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 




130 










135 










140 










Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 








145 










150 










155 













(27) INFORMATION FOR SEQ ID NO: 26: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


Arg 


Asn 


Leu 


Asn 


1 








5 










10 










15 




Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Arg 


Pro 


Leu 


Phe 


Glu 


Asp 






20 










25 










30 






Met 


Thr 


Asp 


Ser 


Asp 


Ser 


Arg 


Asp 


Asn 


Ala 


Pro 


Arg 


Thr 


He 


Phe 


He 






35 










40 










45 








He 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 




50 










55 










60 










Ser 


Val 


Lys 


Ser 


Glu 


Lys 


lie 


Ser 


Thr 


Leu 


Ser 


Ala 


Glu 


Asn 


Lys 


He 


65 










70 










75 










80 


He 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 










85 










90 










95 




Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 








100 










105 










110 






Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Ser 


Glu 






115 










120 










125 








Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 


130 










135 










140 










Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 








145 










150 










155 













(28) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
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Asn 


Phe 


Glv 


Arcr 


Leu 


His 


Ala 


Thr- 


1 111 


Aid 


va l 


lie 


1 








5 














Asp 


Gin 


Val 


Leu 
20 


Phe 


Va 1 


J\en 


T . \/ c: 


2 S 


bin 


Pro 


Va 1 


Thr 


As D 


lie 

JL. J» 




Gl n 




A 1 a 


otrr 


VjIU 


Pro 


Gin 


Thr 






35 










*± \j 








1 y *- 


1 IC 


Tvr 

lyr 




& c* *^ 

nop 


C -v- 

o€ r 


vjIU 


val 


Arg 


Gly 


Leu 


Ala 


Val 


50 










5 s 








60 


Lvs 


Asp 






/ u 


o e r 


inr 


Leu 


Ser 


Cys 
75 


Lys 


Ser 


Phe 


Glu 


Glu 






fro 


fro 


VjIU 


Asn 


He 


Asp 










85 










90 




Asp 


Leu 


lie 


Phe 


Phe 


Gin 


Lys 


Arg 


Val 


Pro 


Gly 


His 








100 










105 






Phe 


Glu 


Ser 


Ser 


Leu 


Tyr 


Glu 


Gly 


His 


Phe 


Leu 


Ala 






115 










120 








Asp 


Asp 


Ala 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Lys 


Asp 


Lys 


130 


Val 








135 








140 


Ser 


Met 


Phe 


Thr 


Leu 


Thr 


Asn 


Leu 


His 


Gin 


145 










150 










155 



15 

Phe Glu As 
30 

Arg Leu IL 
45 



80 

Asp He Gin Se 
95 

Asn Lys Met Gli 
110 

Cys Gin Lys Gli 
125 



(29) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 



1 5 
Asp Gin Val Leu Ph 

20 

Thr Asp He Asp Gl 
35 

Tyr Met Tyr Lys As 
50 

Val Lys Asp Ser Ly 
65 

Ser Phe Glu Glu Me 

85 

Asp Leu He Phe Ph 

100 

Phe Glu Ser Ser Le 
115 

Asp Asp Ala Phe Ly 
130 

Lys Ser Val Met Ph 
145 



His 


Cys 


Thr 


Thr 


Ala 


Val 


He 


Arg 


Asn 


He 


Asn 


Val 








10 










15 




Asp 


Lys 


Arg 


Gin 


Pro 


Val 


Phe 


Glu 


Asp 


Met 




Ala 




25 










30 






Ser 


Ser 
40 


Glu 


Pro 


Gin 


Thr 


Arg 

— - 


Leu 


He 


He 


Ser 


Glu 


Val 


Arg 


Gly 


Leu 


Ala 


val 


Thr 


Leu 


Ser 




55 










60 








Met 


Ser 


Thr 


Leu 


Ser 


Cys 


Lys 


Asn 


Lys 


He 


He 


70 










75 








80 


Asp 


Pro 


Pro 


Glu 


Asn 


He 


Asp 


Asp 


He 


Gin 


Ser 


Gin 








90 








95 




Lys 


Arg 


Val 


Pro 


Gly 


His 


Asn 


Lys 


Met 


Glu 




Glu 




105 










110 






Tyr 


Gly 


His 


Phe 


Leu 


Ala 


Ser 


Gin 


Lys 


Glu 




He 


120 










125 






Leu 


Leu 


Lys 


Lys 


Lys 


Asp 


Glu 


Asn 


Gly 


Asp 




135 










140 






Thr 


Leu 


Thr 


Asn 


Leu 


His 


Gin 


Ser 








150 










155 













(aims 

An osteoclastgenic inhibitory agent, which comprises an interleukin-1 8 or its functional equivalent. 

The inhibitory agent of claim 1 , wherein said interleukin-18 includes the amino acid sequences of SEQ ID NO 1 
SEQ ID NO: 2, and SEQ ID NO: 3 as partial amino acid sequences. 

The inhibitory agent of cfaim 1 wherein said interleukin-18 includes the amino acid sequences of SEQ ID NO: 4 
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and SEQ ID NO: 5 as partial amino acid sequences. 

4. The inhibitory agent of claim 1 . wherein said interleukin-18 includes the amino acid sequence of SEQ ID NO: 6. 

5. The inhibitory agent of claim 1 . wherein said interleukin-18 is human origin. 

6. The inhibitory agent of claim 1. wherein said interleukin-18 includes the amino acid sequence of SEQ ID NO: 7. 

7. The inhibitory agent of claim 1, which is a therapeutic agent for osteoclast-related diseases 

8. The inhibitory agent of claim 1 , which contains a protein, buffer or saccharide as a stabilizer. 

9. The inhibitory agent of claim 1 , which is in the form of a liquid, paste, or solid. 

10. The inhibitory agent of claim 1, which contains 0.000002-100 w/w % of said interleukin-18. 

11. An inhibitory agent as defined in any preceding claim, for use as a pharmaceutical. 

1 2. Use of an inhibitory agent as defined in any of claims 1 -1 0 for the preparation of a medicament effective for treating 
and/or preventing osteoclast-related diseases. 
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EcoRI 



Sphl 




Xhd 2218 



FIG. 2 
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Xhc\ 2218 

BG. 3 



Hind III 
P CMV 




HulGIF 



FIG. 4 
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